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Docket No. P3250 

5 

METHODS, APPARATUS AND COMPUTER PROGRAM 
PRODUCTS FOR FORMULATING CULTURE MEDIA 

10 

FIELD OF THE INVENTION 

This invention relates to methods, apparatus and computer program 
products for formulating culture media, and more particularly to methods, 
apparatus and computer program products for identifying culture media 
15 components. 

BACKGROUND OF THE INVENTION 

Media used to grow cultured cells for both industrial and clinical 
applications are usually chemically undefined or, at best, semi-defined. It would 

20 be advantageous to employ chemically-defined media in cell culture systems; 
however, most attempts to grow cultured cells exclusively in chemically-defined 
media have been unsuccessful and result in a high frequency of cell death and/or 
poor cell performance. These efforts have failed, at least in part, because those 
components of undefined media that promote cell viability and performance 

25 remain uncharacterized. Thus, it would be desirable to develop improved 
methods of identifying media components that support and enhance cell viability 
and performance. 

Hydrolysates are the most common undefined substance used in 
bacteriological media today for both clinical and fermentation applications. 
30 Current media optimization frequently starts with these undefined substances. 
Hydrolysates are used to replace serum (another undefined substance) in 
mammalian culture (Saha and Sen, (1989) Acta Virol. 33:338). 

The inclusion of tissue and protein hydrolysates in bacterial growth media 
has been practiced since the late ISOOs. Retger, (1927) J. Immunology 13:323, 
35 provided some of the earliest details on preparing and combining hydrolysates. 
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Retger's data demonstrated that best toxin yields from Corynebacteria diptheriae 
were obtained with medium formulated with hydrolysates containing a lower 
percentage of full-length proteins and a higher concentration of peptides. Kihara 
et al., (1952) J. Biol. Chem. 197:801, also found that cultured cells performed 
5 better in medium containing peptides as compared with the constituent amino 
acids. 

There are several drawbacks associated with using digests in cell culture 
media, for example, the range of peptide sequences available for incorporation 
into culture media is limited by the starting substrate and enzyme or acid used in 

10 the digestion. Many of the currently-available digests are difficult to reproduce, 
with significant lot-to-lot variation being commonplace. Digests of tissue obtained 
from a slaughterhouse are often used as components of cell culture media. 
These digests are among the most difficult to reproduce since the starting 
material and ratios of the starting substrate vary. Casein digests are more 

15 reproducible because milk is a more homogenous source of protein than 
slaughterhouse tissue. However, the range of peptide sequences generated by 
an enzymatic or acid digestion of casein is small. Most media manufacturers 
blend these two types of digest to yield a medium that provides better results 
than can be achieved with either digest used alone. Unfortunately, the use of 

20 blends adds another level of complexity to the manufacturing process and is also 
a source of lot-to-lot variability. 

Moreover, both hydrolysates and sera are problematic for pharmaceutical 
applications, since each potentially harbors pathogens. The undefined nature of 
hydrolysates and sera also leads to problems in manufacturing. To illustrate, the 

25 high molecular weight components found in both types of undefined substances 
create additional downstream processing costs. Furthermore, the undefined 
nature of hydrolysates and sera leads to lot-to-lot variability. Previous attempts at 
developing chemically-defined media, however, have generally suffered from 
sub-optimal cell performance and unacceptable levels of cell mortality. 

30 Historically, there have been several approaches employed to determine 

the type(s) of nutrients consumed or preferred by cells grown in culture. One of 
the most common practices is post-culture analysis, whereby the spent medium 
is evaluated to identify constituents removed from the medium. This approach 
has only rarely lead to the identification of compounds that can be isolated for 
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use as a medium component or can be employed as a benchmark to monitor 
future hydrolysate or serum lots. In addition, this approach cannot identify 
compounds that function through signaling and are not physically removed from 
the medium. 

5 Zhao et a!., (1996) AppI, Microbiol. Biotechnol. 45:778, compared the 

bacterial growth-stimulating activity of bovine hemoglobin with that of specific 
peptides from a peptic hydrolysate of this protein. A particular peptide fragment 
was demonstrated to promote cell growth of gram negative bacteria to a greater 
extent than the intact protein. No such enhancement was observed when the 

10 constituent amino acids of this active fragment were assayed, leading these 
investigators to suggest that this peptide fragment did not simply act as an amino 
acid supplier, but rather interacted with peptide permeases on the cell 
membrane. This strategy can be employed in other systems as well to provide 
some information as to specific biologically-active peptides produced by 

1 5 hydrolysis of whole proteins. 

More recently, Automated Cell Technologies (ACT; Pittsburgh, PA) has 
suggested that it will use a "combinatorial cell culture" to discover improved 
media for growing hematopoietic stem cells ex vivo. In Vivo: Bus. Med. Rep. 
15:38 (Dec. 1997). Utilizing a 384-well microliter plate and a robotic pipetting 

20 system, ACT proposes to add different growth factor combinations to various 
mixes of culture media in an effort to identify a culture medium that will support 
stem cell growth ex vivo. It is unstated whether chemically-defined or undefined 
media will be used. 

All of the previously-described methods fail to further an understanding of 

25 the physical, chemical or other properties of the medium components that 
contribute to the enhanced cell performance in culture. As a result, these 
methods are inefficient as they fail to provide a means of predicting and 
systematically screening additional lead compounds. 

Most of the research on quantitative structure-activity relationships 

30 (QSAR) have focused on small organic molecules in medicinal and 
environmental chemistry. Peptides have not been studied as much owing to 
difficulties in developing descriptors for amino acid side chains. The earliest 
attempt to quantify amino acids for QSAR was by Sneath, (1966) J. Theoretical 
Biology, 12:157, who developed four semi-quantitative descriptors for amino 
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acids. Later, Hellberg et al., (1987) J. Med, Chem. 30:1126, developed a set of 
principal components, which were derived from twenty-nine measured and 
theoretical properties of amino acids, including molecular weight, isoelectric point, 
nuclear magnetic resonance parameters, logP (hydrophobicity), thin layer 
5 chromatography, and high performance liquid chromatography parameters. 
Principle component analysis led to three principal components, which Hellberg 
et al. called z1, z2 and z3. Theoretically-derived parameters have been 
developed by Norinder, (1991) Peptides, 12:1223, and Cocchi and Johansson, 
(1993) Quant Struct-Act Relat 12:1. In all of these instances, parameters were 
10 developed for the individual amino acids, but none were measured on whole 
peptides. 

Cho et al., (1998) J. Chem, Inf. Comput Sci. 38:259, used a rational drug 
design approach to identify peptides in a targeted virtual library having 
bradykinin-potentiating activity. This group identified virtual peptides predicted to 

15 be enhanced in amino acid building blocks with bradykinin-like activity based on 
analysis of a starting set of peptides known to possess such activity. These 
investigators employed topological indices or physiochemical descriptors of 
individual amino acid building blocks from known leads to design a virtual 
targeted library. Cho et al. also report that computational limitations prevented 

20 complete analysis of all indices for every virtual peptide within their targeted 
library. Cho et al. did not apply their methods to media development. 

Thus, previous attempts to improve culture media have largely relied on ad 
hoc trial-and-error techniques. There remains a need in the art for systematic 
and predictive methods for identifying medium components to improve cell 

25 performance in culture. Moreover, there is a need in the art for high through-put 
methods for identifying medium components. 

SUMMARY OF THE INVENTION 

Development of industrial and clinical culture media has been impeded, in 
30 large part, because of a lack of understanding as to which components of culture 
media promote cell performance. To compensate, currently-utilized media 
generally rely on the inclusion of complex and chemically-undefined substances, 
such as hydrolysates or serum, to support culture performance. The inability to 
identify and predict defined compounds for culture media formulations that will 
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enhance culture performance is problematic, particularly in an industrial 
manufacturing setting where low cost, high performance, and reproducible 
systems are critical. 

It is therefore an object of the present invention to provide methods, 
5 apparatus and computer program products for identifying culture medium 
components and for pairing new culture medium components with established 
media components. 

It is also an object of the present invention to provide methods, apparatus 
and computer program products for predicting the activity (e.g,, biological activity) 
10 of compounds from a compound library. 

It is still another object of the present invention to provide methods, 
apparatus and computer program products for defining a test compound library 
from a larger compound space. 

It is yet a further object of the present invention to provide methods, 
15 apparatus and computer program products for predicting an activity of a peptide 
based on parameters {i.e., descriptors) for whole molecules, constituent amino 
acids, or combinations thereof. 

It is still a further object of the invention to provide more economical and 
rapid methods of identifying compounds with desired activities, e.g., for use in 
20 culture medium, drug discovery and therapy, and/or diagnostics. 

These and other objects, features and advantages are provided by the 
present invention, which utilizes systematic design methods to identify lead 
compounds and to predict the structures of additional leads, e.g., for the 
formulation of culture media. Moreover, the present inventors have discovered 
25 that whole molecule descriptors of compounds (e.g., total dipole moment, 
molecular weight) can play an important (and in some cases, primary) role in 
predicting the activity of a compound and in more efficient exploration of 
compound space. 

The media formulated by the present invention can result in improved 
30 products for diagnostic applications, for example, in products where the media is 
a single component such as plated media, or as a stand-alone product such as 
dehydrated culture media or liquid media. The new formulas may also enhance 
the manufacturing of products cultured in fermenters and bioreactors. In addition, 
the media formulated according to the present invention can provide an improved 
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environment for cell research and drug discovery. In particular, the present 
invention can facilitate the development of culture media to maintain and 
propagate primary cells and cell lines that have been difficult to maintain in vitro 
in traditional media. 

5 According to one preferred embodiment of the invention, a first test library 

of compounds is evaluated to identify compounds within the library with a desired 
characteristic(s) for use as a component of a culture medium. A plurality of 
culture media, each containing a respective test compound(s) from within the first 
test library, is screened by measuring an indicia of a property of the culture 

10 media. Typically, the indicia of the property is measured from a plurality of first 
cell cultures each containing a respective culture medium containing a respective 
test compound. Illustrative properties of the culture media containing the test 
compounds that may be measured according to the present invention include the 
ability to alter (e.g., induce or enhance, alternatively, suppress or inhibit) the 

15 growth of cultured cells, the ability to alter the production (e.g., at the level of 
transcription, translation, post-translational processing, intracellular transport, 
secretion, turnover, and the like) of a peptide(s) and/or protein(s) (e.g., antigens, 
toxins, antibodies, hormones, growth factors, cytokines, clotting factors, and 
enzymes), and the ability to alter the synthesis and/or secretion of other 

20 compounds including but not limited to antibiotics, steroids, carbohydrates, lipids 
and nucleic acids. Additional properties include the ability to alter (as defined 
above) the maturation, differentiation, growth and/or proliferation of cells. 

In particular embodiments, a relationship (e.g., a mathematical 
relationship) is determined between at least one parameter or descriptor (e.g., 

25 physical, chemical, biological and/or topological parameters) of the test 
compounds from within the first test library which are included in the plurality of 
first culture media and the measured indicia of the property. The relationship can 
be used as a predictor to identify additional lead compounds as components of 
culture media that are expected, based on their parameters, to give indicia of the 

30 measured property that satisfy a test requirement. Illustrative parameters that 
may be employed according to the present invention include but are not limited to 
molecular weight, charge, isoelectric point, total dipole moment, isotropic surface 
area, electronic charge index, and hydrophobicity of the whole molecule (e.g., 
peptide, oligonucleotide, carbohydrate, lipid etc.) or individual building block (e.g., 
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amino acid, nucleotide, monosaccharide, triglyceride etc.) in the molecule. Any 
suitable topological parameter known in the art may be employed, such as those 
described by L.B. Kier and L.H. Hall, Molecular Connectivity in Structure-activity 
Analysis, Research Studies Press, John Wiley & Sons, Letchworth England 
5 (1986); M. Johnson et aL, Concepts and Applications of Molecular Similarity, 
John Wiley & Sons, New York (1990); and R. P. Sheridan et aL, (1995) J. Chem. 
Inf. Comput ScL 35:310. The term "parameters" as used herein also 
encompasses the principle components of S. Hellberg et aL, (1987) J. Med. 
Chem. 30:1 126 {e.g., z1 , z2, z3). 

10 A test requirement is determined against which the measured indicia of the 

property are compared. The test requirement may be determined a priori or it 
may be determined before or after operations to determine a relationship 
between the parameter(s) of the first test compounds and the measured indicia of 
the property of the plurality of culture media containing the first test compounds. 

15 The test requirement may be determined so that indicia of the property falling 
above the test requirement are desirable. Alternatively, the test requirement may 
be chosen so that indicia of the property falling below the test requirement are 
preferred. As a further alternative, the test requirement may be such that indicia 
of the measured property that fall within a particular range are preferred {e.g., for 

20 cell growth, it may be advantageous to select for growth above a threshold level 
but below a maximum level, as growth rates above the maximum may adversely 
affect other aspects of cell performance). Alternatively, the test requirement may 
be qualitative, rather than quantitative, in nature. 

The relationship is used to predict the structure of a plurality of untested 

25 compounds each of which, when included as a component of a culture medium, 
is expected to provide indicia of the property that satisfies the test requirement. 
Operations are also performed to identify a second test library containing a 
plurality of second test compounds as components of a plurality of second culture 
media. The plurality of second culture media, which each contain a respective 

30 test compound from within the second test library, are predicted to provide indicia 
of the property that satisfy the test requirement based on the relationship 
determined between the parameter(s) of the first test compounds and the first 
indicia of the measured property. 
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Optionally, and preferably, the plurality of second test compounds will 
include at least one compound that was not among the set of first test 
compounds. It Is also preferred that the plurality of second test compounds 
includes one or more test compounds from within the first test library (/.e., as a 
5 control). 

The steps of measuring indicia of a property of a plurality of culture media 
each containing a respective test compound, determining a relationship between 
the measured indicia and at least one parameter of the test compounds, and then 
determining a follow-up set of compounds may be carried out more than one time 
10 so as to identify a compound(s) that provides a desired indicia of the property 
when included as a component of culture media. Alternatively, if the first set of 
test compounds provides a compound having a desired activity, the screening 
process may end at that point without screening a second, or successive, set of 
test compounds. 

15 The relationship determined between the parameter(s) of the first test 

compounds and the indicia of the measured property can be determined by any 
method for describing the interaction between the activity and structure of 
compounds, for example, by quantitative structure-activity relationships (QSAR), 
nearest neighbor analysis, self-organizing maps, or other machine learning and 

20 statistical techniques. 

In one preferred embodiment, the relationship may be expressed in the 
form of yi = f(Xjj), where Xy denotes a parameter, i ranges from 1 to n, where n 
represents the number of first culture media, j ranges from 1 to d, where d 
represents the number of parameters measured, and y \ represents an estimate 

25 of the measured first indicia of the property. The relationship represented by yi = 
f(Xij) may be a parametric or non-parametric formula. 

According to another preferred embodiment of the present invention, the 
relationship between the parameter(s) of the test compounds and the indicia of 
the measured property is based on a distance function between the parameters 

30 of the tested compounds in the first test library and the parameters of untested 
compounds. The distance function can be expressed as d(xi, X2) between a first 
value of a parameter, Xi, of a first test compound and a second value of the 
same parameter, X2, of a second untested compound. This relationship will 
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assign to culture media containing a second untested compound an estimated 
indicia of the property that corresponds to the measured indicia determined from 
a culture medium containing a first tested compound from the first test library if 
d(xi, X2) < dcutoffi, where dcutoffi is a cutoff distance for the first test compound. In 
5 other words, once a lead compound is identified from the first test library, 
additional lead compounds can be determined based on an assumption that 
compounds that are close in parameter space will exhibit similar activities. 
Accordingly, there is an increased probability that compounds close in parameter 
space will provide similar or better indicia of the measured property. In particular 

10 embodiments of the invention, xi and X2 represent a single parameter or, 
alternatively, a set of parameters, i.e., Xi = Xn, X12, X13, xu - . . Xik and X2 =X2i, X22, 
X23, X24 . . . X2k, where k > 1. One specific example of a method of determining a 
relationship based on distance in parameter space is "nearest neighbor" analysis. 
Other non-limiting and illustrative methods are cluster analysis, self-organizing 

15 maps, and machine learning approaches. See generally, B. B. Ripley Pattern 
Recognition and Neural Networks, Cambridge University Press, New York 
(1996). 

In still another particular embodiment, more than one relationship may be 
used to identify a plurality of second test compounds from within a second test 

20 library that are predicted to give indicia of the property that satisfy the test 
requirement when included as a culture medium component (e.g., both a QSAR- 
type and nearest neighbor-type relationship may be employed). Furthermore, the 
methods of the present invention described hereinabove are preferably practiced 
in an iterative fashion, whereby the lead compounds identified in the second test 

25 library can be used to determine additional lead compounds in a third test library, 
eta, until compounds that provide the desired characteristics are identified. 
Moreover, the relationship determined in each iteration need not be fixed. To 
illustrate, one type of relationship may be determined to identify a set of second 
test compounds, but a different relationship may be determined in subsequent 

30 iterations. 

In addition, the inventive methods can be used to identify a "cocktail" of 
compounds for formulating culture media - the methods described hereinabove 
are not limited to media containing only a single test compound therein. 
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The second test library may be partially or completely co-extensive with 
the first test library {i.e., encompass some or all of the same compounds). 
Alternatively, there may be no common compounds in the first and second test 
libraries. The second test library will generally be smaller in size than the first 
5 test library, and may be a subset thereof. 

The test compounds from the first and second test libraries may be 
selected using any suitable method known in the art. Preferably, the test 
compounds are selected from the compound space based on a space-filling 
design. It is further preferred that the test compounds selected from the 

10 compound space are representative of the entire compound space. Exemplary 
space-filling designs include but are not limited to full factorial designs, fractional 
factorial designs, maximum diversity libraries, genetic algorithms, coverage 
designs, spread designs, cluster based designs, Latin Hypercube Sampling, and 
other optimal designs (e.g., D-Optimal) and the like. The second test library can 

15 be selected from within the first test library or, alternatively, from within the 
compound space. 

As a further alternative, the information obtained from screening a first 
compound space can be used to start screening a second compound space. For 
example, if the first compound space only contained tetrapeptides, the 

20 information and/or lead peptides achieved by screening the tetrapeptide space 
can be used to begin screening a pentapeptide space. As still a further 
alternative, and as described below, the first compound space may contain 
compounds of different sizes. According to this embodiment, in the preceding 
example it may not be necessary to go outside the first compound space if this 

25 space contained both tetrapeptides and pentapetides. 

According to an additional preferred embodiment, the present invention 
provides a method of defining a test library from a larger compound space. 
Preferably, the test compound library will be representative of the compound 
space. In many instances, a compound space of interest may be so vast that it is 

30 computationally difficult to determine a test library therefrom. In particular, the 
space may be so large that it is not computationally feasible to evaluate the entire 
space in forming a test library. According to the present invention, the compound 
space can be reduced by grouping all compound isomers therein as single 
candidate compounds based on at least one global parameter or descriptor (e.g., 
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compounds having the same molecular weight or chemical formula). Thus, each 
set or group of compound isomers can be represented as a respective candidate 
compound. The contraction of the compound space can advantageously simplify 
the process of determining the first test library (or a follow up library), and is 
5 based on the principle that compound isomers may exhibit similar activities 
because of shared whole molecule parameters (e.g., molecular weight, 
lipophilicity, charge as compared with sequence-specific parameters). A test 
library can be selected from this reduced compound space, e.g., by using a 
space-filling design. 

10 Optionally, and advantageously, after reducing the compound space, 

some of compound isomers within the reduced compound space are selected 
and expanded to re-introduce sequence-specific parameter(s) (e,g., z values, 
isotropic surface area, electronic charge index, hydrophobicity) of individual 
amino acids indexed to their relative positions in the sequence. In this expansion 

15 step, preferably less than all of the grouped candidate compounds in the reduced 
compound space are selected and are then re-expanded into their constituent 
compound isomers. A test library can be selected from the re-expanded set of 
constituent compound isomers. In one particular embodiment, the test library 
contains at least one representative compound from each expanded group of 

20 candidate compounds. 

The test library is selected from the compound space using any suitable 
method known in the art. Preferably, the test library is selected based on a 
space-filling design as described hereinabove. 

Another embodiment of the invention relates to test compound libraries 

25 generated by the above-described methods. The test compound library is 
selected from a larger compound space by representing each set of compound 
isomers in the compound space by a single compound. In preferred 
embodiments, the test library is formed from the reduced compound space by 
selecting less than all of the grouped candidate compounds and re-introducing 

30 sequence-specific parameters to re-generate the constituent compound isomers. 
In a more preferred embodiment, the test library is formed by selecting less than 
all of the constituent compound isomers. More preferably still, the test library is 
formed by selecting at least one compound from each of the re-expanded groups 
of constituent compound isomers. 
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According to another particular enribodiment, the inventive methods of 
forming a test library by reducing (and optionally re-expanding) a compound 
space are employed in a method of forming a culture medium (as described 
hereinabove). These methods can be used to determine the first and/or the 
5 second test libraries as well as to determine subsequent follow-up libraries. 

As still a further aspect, the present invention provides a method of 
predicting the activity of a peptide, preferably based on a whole molecule 
parameter(s) of the peptide. A relationship (e.g., a mathematical relationship) is 
determined between a measured indicia of an activity (e.g., a biological activity) 

10 of a plurality of peptides from a test peptide library and at least one parameter 
(preferably at least one whole molecule parameter) of the test peptides. Based 
on the relationship, the indicia of the activity of untested peptides can be 
predicted based on the parameter(s), preferably at least one whole molecule 
parameter, of the untested peptides. This method finds use, e.g., in rational drug 

15 design, in developing culture media, in methods of identifying and/or designing 
peptides that act as receptor agonists or antagonists, and in methods of 
identifying peptides that induce or enhance, or conversely prevent or inhibit, any 
activity of any target protein (e.g., a receptor or enzyme), cell, or nucleic acid 
(e.g., DNA and/or RNA). 

20 In one particular embodiment, the method is used to identify a peptide with 

a predicted indicia of an activity that satisfies a test requirement. According to 
this embodiment, indicia of an activity of a plurality of test peptides from a first 
test peptide library are measured. A relationship is then determined between at 
least one parameter, preferably at least one whole molecule parameter, and the 

25 measured indicia of the activity of the test peptides. Those skilled in the art will 
appreciate that the relationship may also include sequence-specific parameters in 
addition to a whole molecule parameter(s). The relationship can be employed to 
determine a second test library containing a plurality of test peptides that are 
predicted to provide indicia of the activity that satisfies the test requirement. 

30 In particular embodiments, the test peptide library is selected from a larger 

peptide space. In preferred embodiments, the peptide space is collapsed based 
on whole molecule parameters, and optionally re-expanded by re-introduction of 
sequence-specific parameters, prior to selecting the test peptide library 
therefrom, as described above. 
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still another preferred embodiment of the present invention is an 
apparatus for identifying a compound(s) for forming a culture medium. The 
preferred apparatus comprises means for determining a relationship between the 
measured indicia of a property of a plurality of first culture media, each of which 
5 contain a respective first test compound from within a first test library and at least 
one parameter of the first test compounds. The preferred means further 
comprises means for identifying a second test library containing a plurality of 
second test compounds as components of a plurality of culture media, which 
based on the relationship, are expected to provide second indicia of the property 

10 that satisfy a test requirement. A computer program product is also provided for 
controlling the operation of the determining and identifying means and for 
performing numerical calculations to carry out the above-described operations. 

In particular, a preferred computer program product comprises a 
computer-readable storage medium having computer-readable program code 

15 means embodied in the medium. The preferred computer-readable program 
code means comprises means for determining a relationship between measured 
indicia of a property of a plurality of first culture media each containing a 
respective test compound from within a first test library and at least one physical 
parameter of the first test compounds. Computer-readable program means is 

20 also provided for identifying a second test library containing a plurality of second 
test compounds as components of a plurality of second culture media, which 
based on the relationship between the measured first indicia and the 
parameter(s) are expected to provide indicia of the property which meets a test 
requirement relating to the measured first indicia. In addition, computer-readable 

25 program code means is also provided for performing more detailed ones of the 
above-described operations numerically. This embodiment of the present 
invention therefore provides a tool that can more accurately perform library 
screening to identify compounds for forming an improved culture medium. 

Yet another preferred embodiment of the present invention is an apparatus 

30 for defining a test compound library. This preferred apparatus comprises means 
for representing each of a plurality of groups of compound isomers within a 
compound space as a candidate compound, so that the compound space is 
reduced, e.g., to facilitate computational manipulation and sampling thereof. 
Preferably, the apparatus further comprises means for defining a first test library 
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by selecting and expanding less than all of the candidate compounds into their 
constituent compound isomers. It is further preferred that the apparatus 
comprises means for selecting a test library from the expanded compound 
isomers. A computer program product is also provided for controlling operation 
5 of the representing and defining means and performing numerical calculations to 
carry out the above-described operations. 

A preferred computer program product comprises a computer-readable 
storage medium having computer-readable program code means embodied in 
the medium. The preferred computer-readable program code means comprises 

1 0 means for representing each of a plurality of compound isomers from within a first 
compound space as a respective candidate compound. Computer-readable 
program means is also provided for defining a first test library by expanding less 
than all of the candidate compounds into their constituent compound isomers. In 
addition, computer-readable program code means is also provided for performing 

15 more detailed ones of the above-described operations numerically. This 
embodiment of the present invention is therefore advantageous in that it provides 
a tool for defining a test compound library from a larger compound space. 

These computer program products may be realized in whole or in part as 
software modules running on a computer system. Alternatively, a dedicated 

20 stand-alone system with application specific integrated circuits for performing the 
above-described operations may be provided. 

These and other aspects of the invention are set forth in more detail in the 
description of the invention hereinbelow. 

25 BRIEF DESCRIPTION OF THE DRAWINGS 

FIG, 1 is a flow chart illustrating operations performed by methods, 
apparatus and computer program products according to a first embodiment of the 
present invention. 

30 FIG. 2 is a graph of measured indicia of a property of interest determined 

from a plurality of culture media each containing a respective test peptide from 
within a peptide library (peptides on x-axis). 
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FIG. 3 is a flow chart illustrating operations performed by methods, 
apparatus and computer program products according to a second embodiment of 
the present invention. 

5 

FIG. 4A is a graph of measured indicia of a property of interest determined 
from a plurality of culture media each containing a respective test peptide from 
within a peptide library (peptides on x-axis). 

10 FIG. 4B is a graph of the space surrounding a lead peptide from FIG. 4A 

with respect to two parameters: total dipole and hydrophobicity. The concentric 
circles indicate different cut-off points for the distance relationship. 

FIG. 5 illustrates a general hardware description of an apparatus for 
15 identifying a culture medium component from a compound library according to 
the present invention. 

FIG. 6 is a flow chart illustrating operations performed by methods, 
apparatus and computer program products according to a third embodiment of 
20 the present invention. 

FIG. 7 is a flow chart illustrating operations performed by methods, 
apparatus and computer program products according to a fourth embodiment of 
the present invention. 

25 

FIG, 8 is a flow chart illustrating operations performed by methods, 
apparatus and computer program products according to a fifth embodiment of the 
present invention. 

30 DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention will now be described more fully hereinafter with 
reference to the accompanying drawings, in which preferred embodiments of the 
invention are shown. This invention may, however, be embodied in different 
forms and should not be construed as limited to the embodiments set forth 
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herein. Rather, these embodiments are provided so that this disclosure will be 
thorough and complete, and will fully convey the scope of the invention to those 
skilled in the art. Like numbers refer to like elements throughout. 

Referring now to FIG. 1, preferred operations 100 for identifying a 
5 component of a culture medium use the parameter(s) or descriptor(s) (e.g., 
physical, chemical, biological and/or topological parameters) of compounds within 
a compound space to identify those compounds that are predicted to have a 
particular property (e.g., biological activity) when included as a component of a 
culture medium. The prediction is based on a relationship that is determined 

10 between measured indicia (e.g., quantitative level) of the property observed with 
known compounds. Alternatively, the measured indicia can be a qualitative 
measure (e.g., response/no response). The relationship is used to determine 
those untested compounds that are predicted to give desired indicia of the 
property based on the parameter(s) of the compounds. 

15 The present invention can be employed to identify any type of polymeric 

compound of interest, preferably for use as a component of a culture medium. 
Exemplary classes of compounds include, but are not limited to peptides, 
proteins (including modified proteins, e.g., glycoproteins), lipids, carbohydrates, 
nucleic acids, and the like. Peptides are the preferred compound. 

20 Compound libraries can be made by any method known in the art. 

Individual compounds within the libraries can be isolated and/or synthesized by 
any method known in the art. In particular, peptides can be synthesized by solid 
phase solution phase synthetic methods. For example, the peptides can be 
synthesized by FMOC chemistry (Atherton et al., (1989) Solid Phase Synthesis: 

25 A Practical Approach. IRL Press at Oxford University Press, Oxford, England) on 
an Advanced ChemTech Model 396 synthesizer. Alternatively, peptides may be 
synthesized using other variations of the Merrifield approach (Merrifield, (1965) J. 
Am. Chem. Soc. 85:2149), including Boc chemistry, synthesis on other solid 
supports (e.g., other resins, pins, etc., "tea-bag" synthesis (R. A. Houghten, 

30 (1985) Proc. Nat Acad. Sci. USA 82:5131), and by combinatorial methods (e.g., 
split and divide). Peptides may also be synthesized to include modifications to 
the carboxyl terminus (e.g., esters, amides, etc.), the amino terminus (e.g., acetyl 
groups), and other non-naturally occurring amino acids (e.g., norleucine). 
Methods of oligonucleotide synthesis are also known in the art. See, e.g., 
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Oligonucleotide Synthesis: A Practical Approach, M. J. Gait, ED, IRL Press: 
Washington. D.C., 1984. The generation of carbohydrate libraries is described, 
e.g., in Liang et al., (1996) Science 274:1520. Construction of RNA libraries are 
known in the art, e.g., by SELEX as described by C. Tuerck et aL, (1990) Science 
5 249:505. 

This embodiment of the invention can be used to identify a compound(s) 
(as described above) for use as a component(s) of culture medium (e.g., cell 
culture medium, tissue culture medium, organ culture medium, and the like) 
having any property(ies) of interest. Exemplary properties include, but are not 

10 limited to, altering (e.g., enhancing or increasing, or in contrast, inhibiting or 
suppressing) the growth of cells in culture (e.g., cell division and/or cell size), 
altering production (e.g., at the level of transcription, translation, post-translational 
processing, intracellular transport, secretion, turnover, and the like) of a protein(s) 
and/or peptide(s) (e.g., antigens, toxins, antibodies, hormones, growth factors, 

15 cytokines, clotting factors, enzymes, and the like) by ceils in culture, and altering 
the synthesis, processing and/or secretion of other compounds and/or 
metabolites (e.g., antibiotics, steroids, carbohydrates, lipids, nucleic acids, and 
the like). Additional properties include the ability to alter (as defined above) the 
maturation, differentiation, growth and/or proliferation of cells. 

20 Those skilled in the art will appreciate that a compound may give different 

indicia of the property depending on the particular cell, tissue or organ to be 
cultured. Moreover, the indicia of the property may be affected by the base 
culture medium and/or the presence of other medium components. The culture 
medium can be used to culture any cell, tissue, or organ of interest. Preferably 

25 the culture medium is a cell culture medium. Typically, and preferably, the 
present invention is used to identify a component of culture medium to culture 
cells in vitro. In particular, the present invention can be practiced to culture 
animal (more preferably mammalian, avian or insect), plant, bacterial, protozoan, 
fungal, or yeast cultures. In addition, the culture medium can be one that is used 

30 to package viruses or bacteriophage in host cells. The present invention can also 
be advantageously employed to identify compounds for culture medium for 
cultures of primary cells (e.g., to grow p-islet cells for insulin production). Other 
uses of the present invention are to culture pathogenic organisms for diagnostic 
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purposes, culture cells that have been genetically engineered to express 
recombinant peptides or proteins {e.g., biopharmaceuticals such as interferon, 
tissue plasminogen activator, antibodies; industrial enzymes, in particular, low 
yield industrial enzymes such as restriction enzymes, taq polymerases, 
5 synthetases, and the like), and to culture cells to isolate secondary metabolites 
for use as drugs {e.g., cephalosporins). Finally, the culture medium can be a 
liquid, semi-solid, or solid culture medium. Preferably, the culture medium is a 
liquid medium. 

Indicia of the property may be measured using any suitable method known 

10 in the art. For example, ELISAs or any other immunoassays relying on specific 
binding to an antibody or receptor may be employed. Typically, such methods 
will involve a radiolabeled, fluorescent or other detectable moiety (e.g., a dye or 
intercalator such as acridine orange for DNA). Measurements may also be 
determined using labels that produce signals detectable by spectrophotometry 

15 (including colorimetry and measurement of optical density), x-ray diffraction or 
absorption, magnetism, or enzymatic activity. Chemiluminescence and 
fluorescence lifetime measurements may also be utilized. Suitable labels include 
fluorophores, chromophores, radioactive isotopes, electron-dense reagents, 
enzymes, and ligands having specific binding partners (e.g., biotin-avidin). 

20 Alternatively, a flow-through assay such as those that employ surface plasmon 
resonance detection may be used. 

Cell number and/or size can be readily assessed by methods known in the 
art, e.g., staining and visual observation, turbidity measurements, 
spectrophotometric methods (including colorimetry and measurement of optical 

25 density), counting with an automated cell counter and/or automated plate 
counter, measurement of total cellular DNA and/or protein, impedance of an 
electrical field, bioluminescence, carbon dioxide, oxygen or ATP production or 
consumption, and the like. Proteins and other compounds can be detected 
and/or quantified using standard analytical techniques such as chromatography, 

30 gel separation techniques, and the like. Likewise, methods of detecting nucleic 
acids are well-known in the art and include specific hybridization to probe 
sequences and amplification methods (e.g., polymerase chain reaction, strand 
displacement amplification, etc.). Carbohydrates can be detected by any method 
known in the art, including but not limited to, carbohydrate-specific staining (e.g., 
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lectins or anthrone-based assays), spectrophotometric methods with dyes or 
copper, A205 measurements, or gas-liquid chromatography. 

Any measurement tool known in the art may be used to take 
measurements as described above, e.g., a spectrophotometer for absorption or 
5 colorimetric measurements, a fluorometer or flow cytometer for fluorescence 
measurements, a scintillation or gamma counter for radioactive measurements, 
and an automated cell counter, automated plate counter, or manual plate counter 
for cell number measurements. As a further example, a microwell reader can be 
used for fluorescence, absorbance or colorimetric measurements. 
10 Referring again to FIG. 1, in carrying out this particular embodiment of the 

invention, a first plurality of culture media each containing a first test 
compound(s) is provided. Operations are performed to determine indicia of the 
property of interest for each of the culture media in the first plurality thereof, Block 
102. 

15 The first test compounds are selected from a first test library of 

compounds, preferably using a space-filling design. It is also preferable that the 
first test compounds be representative of the first test library. The term "space- 
filling design" as used herein is intended to be construed broadly and includes all 
such techniques known to those skilled in the art. Exemplary space-filling 

20 designs include but are not limited to full factorial designs, fractional factorial 
designs, maximum diversity libraries, genetic algorithms, coverage designs, 
spread designs, cluster based designs, Latin Hypercube Sampling, and other 
optimal designs (e.g., D-Optimal), and the like. 

A space-filling design assists in selecting experimental design points. 

25 Ideally, all data would be gathered at every possible combination of the 
explanatory variables which may possibly affect the response of interest, in other 
words, fill the entire space. When the candidate space is very large and the 
number of possible values is large, it may not be feasible to enumerate all such 
possible combinations, much less physically gather the data. For example, it 

30 would generally not be feasible to evaluate all possible peptide tetramers or 
pentamers {i.e., 160,000 possible tetramers and 3,200,000 possible pentamers). 
Space-filling designs provide a strategy for gathering data at a set of design 
points, such that the data gathered will efficiently represent all candidate 
compounds, known as the candidate space. When no prior information or 
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knowledge is available, one method of generating a space-filling design is to use 
a geometric distance-based criterion. 

Two general categories of distance-based designs are minimax and 
maximin. Assume that C denotes a finite set of possible design points and that 
there is a distance function d on CxC such that (C, d) is a metric space. Consider 
subsets D of C of size n. D is called a distance-based design if the design criteria 
depends on the distance function d. The minimax criterion attempts to cover the 
experimental space by locating design points so as to minimize the maximum 
distance from any candidate point to the closest design point More specifically, 
call D* a minimax distance design if 

minmaxJ(7,D) = m2xd{y,D^) where: d{y,D) - min^^^ d{y,x) . 

D yeC 

The maximin criterion tries to spread the design points in space so as to 
maximize the minimum distance between the pairs of design points. In particular, 
we call a maximin distance design if 

maxmin^/(x,x') = min d(x^x') = d"" 

Maximin designs can be generated by Gosset (Hardin and Sloane, (1992) 
Operating Manual for Gosset: A General Purpose Program for Constructing 
Experimental Designs (2d ed.), Mathematical Science Research Center, AT&T 
Bell Laboratories, Murray Hill, New Jersey). 

Approximations to these criterion that are more numerically stable and can 
be found using an exchange algorithm, are the "coverage" and "spread" criteria, 
respectively. The maximin, or spread criterion, tends to produce designs with a 
large number of design points at the boundaries of the region or most extreme 
values, while the minimax, or coverage criterion, produces designs with more 
points in the interior of the region. 

A coverage design minimizes the following criterion for a choice of 
parameters p and q: 
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f Y'^ 

c,,{C,D) = \Y,{d^{y,D)rlNc 

where the distance metric is defined as: d^(y,D) = XilF ~ y\\ A 
where p < 0 and q > 0. 



5 Alternatively, as another space-filling design, test libraries can be 

generated using a genetic algorithm. In general, a genetic algorithm is based on 
the model of natural selection. Genetic algorithms optimize structures by 
computationally performing selection, crossover, and mutation in a population of 
structures in a manner analogous to natural selection. A given population of 

10 compounds is encoded as binary structures ("chromosomes"), and their 
opportunity to "reproduce" and be included in succeeding generations is based 
on their biological activities. In the reproduction step, the chromosomes for two 
compounds are crossed at a single point to produce two new "children" 
compounds. Mutation occurs by randomly changing any single bit in the 

15 sequence. The chromosomes are then decoded into compound structures, 
which are then synthesized and tested, and the process is repeated for the next 
generation. 

A typical genetic algorithm runs as follows: 



20 Step 1 . Initialize a population of chromosomes, /.a, compounds (this 

can be done completely at random by a computer, or selected structures can be 
used to "seed" the initial population). 



Step 2 . Evaluate each chromosome in the initial population (e.g., 
25 synthesize and test every peptide in the initial population). 



Step 3 . Create new chromosomes by mating current chromosomes; 
apply mutation and recombination as the parent chromosomes mate (this is done 
by feeding the indicia of the properties of the compounds into the computer, and 
30 the program performs the mutation and recombination process). 
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step 4 . Delete members of the population to make room for the new 
chromosomes (the population will always be fixed at a particular size. The 
program will select which compounds get deleted, which are usually the poorest- 
performing compounds). 

5 

Step 5 . Evaluate the new chromosomes and insert them into the 
population. 

Step 6 . The process can end at this point, with the best 
10 chromosome(s) being selected; alternatively, additional generations can be 
followed by repeating steps 3-5. 

To illustrate, the following example using model data is presented. In 
selecting a peptide library using a genetic algorithm, the chromosomes will be 
15 individual peptides. Each amino acid may be represented as a binary string. For 
a 4-bit string, there are 16 possible combinations (Table 1). If, for example, only 
10 of the possible amino acids are used, 6 of these amino acids must be 
represented twice (e.g., Gly is represented by 1010 and 1011), so that all of the 
16 possible combinations are assigned to an amino acid as follows: 
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Table 1 



Binary String 


Amino Acid 


0000 


Va! 


0001 


Glu 


0010 


Leu 


0011 


Pro 


0100 


Lys 


0101 


Ser 


0110 


Ala 


0111 


Val 


1000 


Phe 


1001 


Glu 


1010 


Gly 


1011 


Gly 


1100 


Ser 


1101 


Phe 


1110 


Gin 


1111 


Pro 



An initial population of tetrapeptides can be generated using a random 
number generator. Structures can be modified at this point because of possible 
synthetic difficulties or to ensure that each amino acid is represented at each 
position, etc. Assume the following set of chromosomes (peptides) are 
generated: 
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Table 2 



SEQ ID NO: 


Peptides 


Binary String 


1 


Gly-Ala-Leu-Gly 


1010011000101010 


2 


GIn-Gly-Val-Glu 


1110101000000001 


3 


Ser-Ala-Pro-Va! 


0101011000110000 


4 


Ser-Pro-Aia-GIn 


0101001101101110 


5 


Glu-Glu-Val-Phe 


0001000100001000 


6 


Vai-Leu-Ser-Lys 


0000001001010100 


7 


Va!-Ser-G!u-Leu 


0000010100010010 


8 


Pro-Phe-Glu-Pro 


0011100000010011 


9 


Glu-Leu-Gin-Glu 


0001001011100001 


10 


Lys-Vai-Gln-Phe 


0100000011101000 


11 


Gly-Lys-AIa-Pro 


1010010001100011 


12 


Aia-Gin-Lys-Ser 


0110111001000101 


13 


Ala-Gln-Gly-Glu 


0110111010100001 


14 


Lys-Giu-Phe-Gly 


0100000110001010 


15 


Pro-Ser-Phe-Lys 


0011010110000100 


16 


Phe-Ser-Leu-Ala 


1000010100100110 


17 


Leu-Phe-Gly-Aia 


0010100010100110 


18 


Glu-Val-Lys-Ser 


0001000001000101 


19 


Val-Gly-Glu-Ala 


0000101000010110 


20 


GIn-Glu-Ser-GIn 


1110000101011110 



5 If, for example, the computer decided to cross Gly-Ala-Leu-Gly (SEQ ID 

NO:1) and Ser-Ala-Pro-Val (SEQ ID NO:3) at their mid-points, it would generate 
two new children chromosomes/peptides to the population to test: Gly-AIa-Pro- 
Val (SEQ ID NO:21) and Ser-Ala-Leu-Gly (SEQ ID NO:22). 

Genetic algorithms are described in more detail, e.g., in J. Sing et al., 
10 (1996) J. Am, Chem. Soc. 118:1669 and Handbook of Genetic Algorithms . 
Lawrence Davis, Ed. , Van Nostrand Reinhold: New York, 1991. 

Referring back to FIG. 1 , the test compounds can be all or a subset of the 
compounds in the first test library. The first test library can be selected on the 
basis of any criterion known in the art. For example, the first test library may 
15 include all possible pentapeptides (naturally occurring and/or non-naturally 
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occurring). Alternatively, the first library may contain all possible penta peptides 
based on a set often possible amino acids. As a further non-limiting example, ail 
of the compounds in the first test library may have a specific subunit designated 
in a particular position(s) (e.g., the first amino acid may always be an alanine or 
5 an aromatic amino acid). It is not necessary that all of the test compounds in the 
first test library actually be synthesized and/or isolated, e.g., the library may be a 
"virtual" library. Typically, it is only necessary that the test compounds for which 
an indicia of the property is determined actually be synthesized, alternatively 
isolated, and evaluated. Alternatively, it is possible that indicia of the property of 

10 a particular compound(s) may be determined from other sources (e.g., the 
literature or from previous studies or studies from other investigators) and, 
therefore, this compound(s) would not have to be synthesized and a 
measurement of the indicia of the property of interest determined. 

It is preferred that the present invention be carried out to screen a peptide 

15 library to identify peptides for use as components of culture medium. There are 
no particular requirements of the peptide library to be used to carry out this 
embodiment of the invention. The peptides in the library may contain naturally- 
occurring and/or synthetic amino acids. The library may also contain modified 
amino acids (e.g., phosphorylated, methylated, glycosylated, and the like). 

20 Moreover, the peptide library can be defined to contain less than all of the 
possible naturally-occurring and/or synthetic amino acids. The peptide library 
may also be defined so that all of the peptides therein have the same length or 
range of lengths. Alternatively, the peptides in the library may vary in length, e.g., 
tetramers, pentamers and/or hexamers. Peptide libraries in which all of the 

25 peptides have the same length (e.g., 4, 5, 6, 7, 8, 9, 10 or more amino acids) are 
preferred. In preferred embodiments, the peptide library contains peptides 
having a length of four, five, six, seven, eight, nine, or ten amino acids, or longer. 
In other preferred embodiments, the peptide library contains peptides having a 
length in a range from about four amino acids to about twenty amino acids, more 

30 preferably, from about four amino acids to about ten amino acids. 

In alternate preferred embodiments, one or more amino acid positions in 
the peptides is fixed (i.e., nonvariable) or limited to specified particular amino 
acid(s) or class(es) of amino acids. For example, in a library of pentapeptides, 
the amino acids at positions 4 and 5 might be fixed as a specific amino acid (e.g.. 
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Ala or Val) or class of amino acids (e.g., aromatics). Likewise, the peptides may 
be 20-mers, but only 5 of the positions may be variable with the other positions 
being fixed. The positions may be fixed based on any criteria, e.g., random 
assignment, prior chemical knowledge, ease of manufacturing and/or synthesis, 
5 cost, and the like. 

Those skilled in the art will appreciate that fixing or limiting the possible 
amino acid(s) at a particular position or positions will reduce the total number of 
possible peptides and may likewise decrease the time, expense and/or technical 
difficulties associated with synthesizing and testing peptides, identifying leads, 

1 0 and follow up (if necessary). 

As a further aspect, the present invention provides a method of screening 
a compound library (preferably a peptide library) in which at least one of the 
amino acid positions is nonvariable or limited to designated subunits (/.e., less 
than ail possible subunits, e.g., amino acids). A compound of interest, e.g., as a 

15 component of a culture medium, may be identified in the first round of screening. 
Alternatively, leads are identified and successive screenings are performed as 
described herein. 

Operations are performed to determine a desired indicia, or alternatively 
range of indicia, of the property to establish a test requirement against which the 

20 measured indicia of the property of the media containing the test compounds are 
compared, Block 106. The test requirement may be determined at any stage in 
the process of identifying a culture medium component. For example, the test 
requirement may be set a priori or, alternatively, it may be determined after the 
indicia of the property of the first plurality of culture media each containing a first 

25 test compound is determined. Moreover, the test requirement may change 
during the compound screening process. 

The test requirement may represent a threshold level and indicia of the 
property falling at or above the test requirement may be desirable (e.g., when 
screening for compounds that increase antibiotic production) or it may represent 

30 a ceiling and values falling below the test requirement may be desirable (e.g., 
when screening for compounds that suppress endotoxin production during 
fermentation processes). As a further alternative, the test requirement may relate 
to a range of desired indicia, i.e., the test requirement may establish both a floor 
and a ceiling for the measured indicia (e.g., to reach a balance between 
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competing factors, such as cell growth and protein production). Those skilled in 
the art will appreciate that the test requirement may represent the optimal indicia 
of the property (e.g., maximal immunogen production), alternatively, the test 
requirement may take into account other criteria such as feasibility, cost, time 
5 constraints, effects on other desired properties of the culture medium, etc. 

As yet a further alternative, the test requirement may be qualitative, rather 
than quantitative, in nature, e.g., if one is looking for the absence/presence of a 
particular response {i.e., a yes/no answer). Those skilled in the art will recognize 
that for computational analysis of qualitative data, the qualitative values will most 

10 likely be converted into quantitative values (e.g., response/no response ^ 1/0) 

Operations are performed to determine a relationship between at least one 
parameter of the first test compounds and the measured indicia of the property 
for each of the first test compounds, Block 104. Preferably, the relationship is a 
mathematical relationship, more preferably a mathematical structure-activity 

15 relationship between the parameter(s) and the property (/,e., activity) of interest. 

There is no particular limit to the number of parameters used to determine the 
relationship, and two, three, four, five, six, seven, eight, nine, ten or more 
parameters can be used. In one illustrative example, a relationship is determined 
between three parameters (molecular weight, hydrophobicity, total charge) of a 

20 plurality of peptides and the measured cell growth or p-toxin production by 
cultures of Clostridium perfringens. 

Any parameter {i.e., descriptor) known in the art that can be applied to 
characterize a compound may be used to carry out the present invention. 
Physical, chemical (including biochemical), biological and/or topological 

25 parameters may be employed to determine the relationship. The term 
"parameter" as used herein is also intended to encompass the principle 
components of S. Hellberg et al., (1987) J. Med. Chem. 30:1126 (e.g., z1, z2, z3). 
The parameter(s) used to describe the test compounds can change in both 
number and type during the selection process. In addition, the parameter(s) can 

30 be a whole molecule parameter(s), sequence specific parameter(s), or a 
combination of both. 

Preferably, the compounds are characterized using at least one whole 
molecule parameter (e.g., one, two, three, four, five, six, seven, eight, nine, ten or 
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more whole molecule parameters). Also preferred are embodiments wherein the 
compounds are characterized using only whole molecule parameters. A "whole 
molecule parameter" is a value that characterizes a molecule irrespective of the 
arrangement of its constitutive atoms. For example, a whole molecule parameter 
5 for a peptide is one that does not depend on the order or sequence of the amino 
acids in the peptide. Describing a molecule using at least one whole molecule 
parameter may facilitate the compound screening process because it reduces 
(i.e., collapses) the size of the compound space and can thereby decrease the 
time, computational difficulty, and cost of screening large compound spaces (as 
10 described in more detail below). 

Conversely, a "sequence-specific" parameter is one that is dependent on 
the specific order or sequence of the constitutive atoms or subunits. Examples of 
particular sequence-specific and whole molecule parameters have been provided 
hereinabove. 

15 Illustrative parameters that may be employed according to the present 

invention include but are not limited to molecular weight, charge, isoelectric point, 
total dipole moment, isotropic surface area, electronic charge index, and 
hydrophobicity (e.g., as exemplified by measurements such as logP, HPLC 
retention times, or other methods of determining hydrophobicity known in the art) 

20 of the whole molecule or individual building block in the molecule (e.gf., peptide, 
amino acid, nucleic acid, sugar unit, etc.). Any suitable topological parameter 
known in the art may be employed, such as those described by L.B. Kier and L.H. 
Hall, Molecular Connectivity in Structure-activity Analysis, Research Studies 
Press, John Wiley & Sons, Letchworth England (1986); M. Johnson et al., 

25 Concepts and Applications of Molecular Similarity, John Wiley & Sons, New York 
(1990); and R. P. Sheridan et al., (1995) J. Chem. Inf. Comput Sci. 35:310. 

Calculations of parameters can be carried out by any method known in the 
art, for example, using a computerized system, e.g., a Silicon Graphics computer 
or a PC. Total charge, molecular weight, and total dipole can be calculated using 

30 Sybyl 6.5 (Tripos). Moriguchi logP (i.e., mIogP, a measure of hydrophobicity) can 
be calculated using a Sybyl Programming Language Script. Literature values of 
electronic charge index and isotropic surface area for amino acids are available, 
see, e.g., E. R. Collantes et al., (1995) J. Med. Chem. 38:2705. A variation of 
electronic charge index can be prepared in an analogous manner using 
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Gasteiger charges supplied by Sybyl instead of CNDO/2 charges used by 
Collantes et al. (Id). Principal connponent descriptors z1, z2, and z3 are provided 
by Hellberg et al., (1987) J, Med, Chem. 30:1126. Calculations of the isoelectric 
point can be carried out using a Sybyl Programming Language Script. 
5 The relationship between the at least one parameter of the test 

compounds and the measured indicia of the property for each of the test 
compounds is used to identify a second plurality of culture media. Each of the 
second culture media contains a second compound(s) from within a second test 
library, where the second plurality of culture media are predicted to give indicia of 

10 the property that satisfy the test requirement, Block 108. Typically, the second 
test compounds will be untested compounds although those skilled in the art will 
appreciate that one or more compounds from the first set of test compounds may 
be included in the second set of test compounds, e.g., as controls. 

In particular embodiments, the second test library includes all compounds 

15 that are predicted to satisfy the test requirement. Alternatively, and preferably, 
the second test library is chosen to include a subset of the total number of 
compounds that satisfy the test requirement. The second set of test compounds 
may include all of the test compounds in the second test library or, alternatively, a 
subset thereof. For example, the second test library may include all peptides 

20 having five amino acids that are predicted to result in antibody production from 
cultured hybridoma cells above a particular level {i.e., the test requirement) when 
added to culture medium. Alternatively, and preferably, the second test 
compounds are selected from, and more preferably are representative of, the 
second test library. Yet more preferably, the second test compounds are 

25 selected from the second test library using a space-filling design, as described 
above. 

In particular embodiments of the invention, indicia of the property of the 
second plurality of culture media are measured, and the indicia compared with 
the test requirement. A lead compound may be identified at this stage that 
30 satisfies the test requirement. Alternatively, the above-described process of FIG. 

1 is carried out in an iterative fashion. A second relationship between at least 
one parameter of the second test compounds and the measured indicia will be 
determined, and a third set of test compounds from a third compound library is 
identified. As a further alternative, if the first test library provides a suitable 
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compound, the screening process can end there without the need to generate a 
second test library or to engage in further compound screening. 

Those skilled in the art will appreciate that the systematic methods 
described hereinabove can be supplemented by knowledge of the chemical 
5 behavior to select a follow-up library of compounds. For example, in screening a 
peptide library, it may become apparent that peptides containing amino acids with 
large aromatic groups exhibit desired indicia of the property. Accordingly, a 
foilow-up library may be chosen that is enriched in such peptides. Alternatively, if 
a desired end product was composed of an abundance of one or more types of 

10 amino acids, peptides containing these amino acids might be selected for 
screening, in particular if it is known that the particular cell line cannot synthesize 
any or sufficient quantities of the amino acid(s). As a further alternative, one may 
choose to make the carboxyl-terminal groups of a peptide as amides or acids 
based on prior knowledge, e.g., these features are known to enhance activity. 

15 Similarly, one might synthesize one library of peptides that all have carboxyl- 
terminal acids and a second library of peptides with carboxyl-terminal amides. If 
one library performs better than the other, one might only use peptides with 
carboxyl-terminal acids or amides for the remainder of the screening iterations. 

In one preferred embodiment of the invention, as diagrammed in FIG. 1, 

20 the relationship may be expressed in the form of yi = f(Xij), where xy denotes a 
parameter, i ranges from 1 to n, where n represents the number of first culture 
media, j ranges from 1 to d, where d represents the number of parameters 
measured, and y\ represents an estimate of the measured first indicia of the 
property, Block 104. The relationship represented by yi = f(Xij) may be a 

25 parametric or non-parametric formula. 

In one particular embodiment, the relationship is a quantitative structure- 
activity relationship (QSAR). This aspect of the invention can be demonstrated 
using an illustrative example, as follows. 

The present invention may be used to identify a peptide to include as a 

30 component of a culture medium. The culture medium may be used to culture 
bacterial cells genetically engineered to produce a heterologous protein of 
interest. Accordingly, it would be desirable to identify a peptide which when 
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included in a culture medium will enhance protein production by bacterial cells 
grown in the culture medium {i.e., to satisfy a test requirement). 

The following discussion is provided to illustrate the algorithm using 
exemplary data. In this example, eight test peptides are selected from a 
5 tetrapeptide library: DKAH, DWPA, ESMH, GVNE, HEDV, ETGS, HYGV, and 
DFGV (SEQ ID NO:23 to SEQ ID NO:30; Table 3). The test peptides may be 
selected from the library by any means known in the art. The values for three 
parameters (molecular weight, total charge, and miogp, i.e., hydrophobicity) may 
be determined for each of the eight peptides. The indicia of the property, in this 
10 example a particular biological activity {i.e., protein production), may be 
determined for the eight peptides as well. The exemplary data are shown in 
Table 3. 



Table 3 



SEQ ID 
NO: 


Peptide 


Hydro- 
phobicity 


IVIol. Wt 


Total 
Charge 


Biol. Act. 


23 


DKAH 


-3.479 


469.499 


0 


15.0 


24 


DWPA 


-1.608 


486.505 


-1 


25.0 


25 


ESMH 


-3.479 


501.535 


-1 


19.3 


26 


GVNE 


-3.421 


416.411 


-1 


14.4 


27 


HEDV 


-4.03 


496.477 


-2 


18.5 


28 


ETGS 


-4.25 


391.357 


-1 


10.2 


29 


HYGV 


-1.278 


474.518 


0 


23.6 


30 


DFGV 


-1.616 


435.457 


-1 


22.0 



15 

Using regression analysis, e.g., with the program S-Plus (Version 3.4 for 
Solaris, Mathsoft, Seattle, Washington), the following equation can be derived to 
20 describe the relationship between the three parameters and the (hypothetical) 
indicia of the property (/* e., biological activities) of the first set of test compounds. 
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y = 3.64 *logP + 0.056*MW- 1 .97*charge + 1 .73 = .999 



(1) 



where y is an estimated indicia of the property, logP is a measure of 
hydrophobicity, and MW\s molecular weight. 
5 If a satisfactory peptide {i.e., satisfies the test requirement) is not identified 

among the first set of test peptides, the screening process will continue. A 
second set of untested peptides can then be selected by any means known in the 
art, and the parameters for the second set of peptides may be calculated. Using 
Equation 1, the predicted activity of a second set of culture media, where each of 

10 the culture media in the set contains one of the second test peptides, can be 

calculated for each culture media in the second set based on the parameters of 
the peptide included therein. For example, a predicted activity of 28.2 was 
derived for a culture medium containing the untested peptide HYPV (SEQ ID 
NO:31; Table 4). This value is higher than any of the biological activities in the 

15 original library, and, thus, this peptide would be a good candidate for synthesis 
and testing. 



Table 4 



Predicted Peptide 


Hydro- 
phobicity 


Mo\. Wt 


Total 
Cliarge 


Predicted 
Biol. Act. 


HYPV 
(SEQ ID NO:31) 


-0.645 


514.583 


0 


28.2 



If the test requirement is for protein production at a level of at least 25, 
then the compound screening process may end with the identification of HYPV 
(SEQ ID NO:31) (assuming the actual biological activity is equal to the predicted 
activity). Alternatively, if the test requirement is set for protein production of at 
25 least 30, then the screening process would continue. The actual indicia of the 
property of a second set of culture media, each containing one of the second test 
peptides, may be determined. From these measurements, a new relationship 
between at least one parameter and biological activity is calculated. From this 
updated equation, a third set of peptides, which when included in culture media 
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are predicted to promote protein production by the bacteria at a level of 30 or 
greater are identified. Typically, this process can continue in an iterative fashion 
until a peptide having the desired biological activity is identified. 

Likewise, if the test requirement was set at a level of at least 20, then three 
5 of the original test peptides would satisfy the test requirement {e.g., DWPA, 
HYGV, and DFGV; SEQ ID NO:24, SEQ ID NO:29, and SEQ ID NO:30, 
respectively; Table 3), and the compound screening process could stop at this 
point or could continue to look for even better performing peptides. 

Referring to FIG. 2, preferred operations for determining the relationship 

10 between the measured indicia of the property of the plurality of first culture media 
each containing a respective test compound and the parameter(s) of the test 
compounds can be illustrated by a graphical representation. The calculated 
values of the d parameter(s) are plotted, and the measured values of the indicia 
of the property for the n culture media are plotted against the parameter values in 

15 d +^ dimensional space. For ease of illustration only, in FIG. 2, /7=10 culture 
media and d=1 parameter. 

Conventional line-fitting algorithms can be used to generate a "best fit" line 
for the plotted data. For example, regression analysis can be utilized to 
determine a mathematical relationship between the indicia of the property and the 

20 value of the parameter for the test compound in each culture medium. The 
relationship can be represented as yi = f(Xij), where xy denotes a parameter, / 
ranges from 1 to n where n represents the number of first culture media, ; ranges 
from 1 to cf where of represents the number of parameters, and y i represents an 
estimate of the measured first indicia of the property of the first culture media. 

25 The relationship can be used to identify a second plurality of culture media 

each containing a respective second test compound which is predicted to provide 
indicia of the measured property that satisfies the test requirement. In FIG. 2, the 
test requirement has been established to select for compounds that provide 
indicia of greater than 20 units. The equation y j = f(Xij) can be employed to 

30 identify those compounds as components of culture medium that will provide 
indicia of the property that lie on the upper right portion of the line of FIG. 2 (/,e., 
provide indicia of the property of greater than 20 units). 
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Alternatively, a distance function can be calculated to identify compounds, 
which when added to culture media, are predicted to provide indicia of a property 
of interest that satisfies a test requirement. In general, the compounds are 
identified based on their proximity in parameter space to known lead compounds. 
5 According to a preferred embodiment, shown in FIG. 3, the present invention is 
used to identify culture medium components based on the parameters of the 
culture medium components, Block 300. Operations are performed to measure 
first indicia of a property of interest for a first plurality of culture media which each 
contains a first test compound chosen from within a first test library based on a 

10 space-filling design, and a test requirement relating to the measured first indicia 
is determined, Blocks 302 and 304. Operations to carry out Blocks 300, 302 and 
304 are as described above for Blocks 100, 102 and 106, respectively. 

The distance function can be expressed as d(xi, X2) between a first value 
of a parameter, xi, of a first test compound and a second value of the same 

15 parameter, X2, of a second untested compound. Block 306. This relationship will 
assign to culture media containing a second untested compound an estimated 
indicia of the property that corresponds to the measured indicia determined from 
a culture medium containing a first tested compound from the first test library if 
d{xi, X2) < dcutoffi, where dcutoffi is a cutoff distance for the first test compound, 

20 Block 308. In other words, once a lead compound is identified from the first test 
library, additional lead compounds can be determined based on an assumption 
that compounds that are close in parameter space will exhibit similar or better 
activities, Block 310. 

In particular embodiments of the invention, Xi and X2 represent a single 

25 parameter or, alternatively, a set of parameters, i.e., Xi = Xn, X12, X13, Xu - • - Xik 
and X2 =X2i, X22, X23, X24 . . . X2k, where k > 1. One specific example of a method 
of determining a relationship based on distance in parameter space is "nearest 
neighbor" analysis. Other non-limiting and illustrative methods are cluster 
analysis, self-organizing maps, and machine learning approaches. See 

30 generally, B. B. Ripley Pattern Recognition and Neural Networks, Cambridge 
University Press, New York (1996). Typically, in this type of analysis, the 
parameters are established a priori, rather than by determining which parameters 
to evaluate based on a relationship, as described above. 
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One particular, and preferred, method of identifying compounds based on 
a distance function is nearest neighbor analysis. This method can be illustrated 
using the following simplified example with model data. 

According to this illustrative example, the present invention can be used to 
5 identify a peptide as a culture medium component, e.g., for mammalian 
hybridoma cells producing and secreting antibodies. In particular, it is desirable 
to identify peptides, which when added to culture medium, will promote antibody 
production by hybridoma cells at a level greater than a test requirement. Four 
test peptides (DKAH, DWPA, ESMH, GVNE; SEQ ID NO:23 to SEQ ID NO:26, 

10 /.e., a training set) may be selected from a peptide library as described above 
with respect to the QSAR example. Values to describe the various parameters of 
the peptides, for example, hydrophobicity (/.e., miogp), molecular weight, and 
total charge may be calculated for each peptide (Table 5). Each peptide may be 
added to hybridoma culture medium and antibody production {i.e., biological 

15 activity) may be measured for the cells cultured in each culture medium (values 
shown in Table 5). 



Table 5 



SEQ 
ID NO: 


Peptide 


Hydro- 
phobicity 


Mol. Wt 


Total 
Charge 


Biol. Act. 


23 


DKAH 


-3.479 


469.499 


0 


15.0 


24 


DWPA 


-1.608 


486.505 


-1 


25.0 


25 


ESMH 


-3.479 


501.535 


-1 


19.3 


26 


GVNE 


-3.421 


416.411 


-1 


14.4 



20 

Assume that there is a second set of untested {i.e., candidate) peptides as 
shown In Table 6. 
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Table 6 



SEQ 
ID NO- 


Peptide 


Hydro- 
nhnbicitv 


Mol. Wt 


Total 


Biol. Act. 


27 


HFDV 




4Qfi 477 




7 


28 


ETGS 


-4.25 


391 357 


-1 


7 


29 


HYGV 


-1.278 


474.518 


0 


? 


30 


DFGV 


-1.616 


435.457 


-1 


7 



The idea of the nearest neighbor rule is to find candidate peptides with parameters 
5 that are similar to those from the peptide(s) with the "best" (in this case highest) 
observed biological activity or the lead peptide(s). Before performing any 
calculations, typically all parameters will be standardized so that they will each 
have an equal contribution to the nearest neighbor calculation. In this illustrative 
example, all parameters may be standardized so that they lie between the values 
10 of 0 and 1. This standardization ensures that all parameters will have an equal 
contribution to the nearest neighbor calculation. A standardized value may be 
computed in the following manner: 

Standardized value = (Original value - Min. value)/ (Max. value - Min. value) (2) 

15 

For example the standardized value of molecular weight for the peptide 
DKAH (SEQ ID NO:23) may be calculated as follows: 

(469-499 - 391 .357)7(501 .535 - 391 .357) = 0.7092 (3) 

20 

The standardized parameter values for the eight peptides are displayed 
below in Table 7. 
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Table 7 



SEQID 
NO: 


Peptide 


Hydrophobicity 


Molecular 
Weight 


Total 
Charge 


23 


DKAH 


0.2594 


0.7092 


1 


24 


DWPA 


0.889 


0.8636 


0.5 


25 


ESMH 


0.2594 


1 


0.5 


26 


GVNE 


0.2789 


0.2274 


0.5 


27 


HEDV 


0.074 


0.9541 


0 


28 


ETGS 


0 


0 


0.5 


29 


HYGV 


1 


0.7548 


1 


30 


DFGV 


0.8863 


0.4003 


0.5 



Once the standardized values have been calculated, nearest neighbors 
5 may be determined by calculating the Euclidean distances between the peptides 
in this 3-dimensional space (where 3 represents the number of parameters). For 
example, the distance between DKAH (SEQ ID NO:23) and HYGV (SEQ ID 
NO:29) is calculated as: 

10 SQRT( (.2594 - if + (.7092 - .7548^ + (1-1)^) = .7420 (4) 

Table 8 contains these calculated distances between the training set of 
peptides and the candidate set of peptides, 

15 Tables 







HEDV 


ETGS 


HYGV 


DFGV 


SEQ ID NO:23 


DKAH 


1.0461 


.9057 


.7420 


.8593 


SEQ ID NO:24 


DWPA 


.9604 


1.2394 


.5236 


.4633 


SEQ ID NO:25 


ESMH 


.5362 


1.0331 


.9266 


.8675 


SEQ ID NO:26 


GVNE 


.9056 


.3599 


1.0238 


.6315 
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The peptides in the candidate set will then be assigned predicted indicia of 
the property based the closest peptide in the training set (Table 9). The 
(hypothetical) biological activities for these four peptides may then be measured 
as shown in Table 9. 

5 

Table 9 



Candidate Peptide 


Closest 
Peptide 


Predicted 
Biol. Activity 


Observed 
Activity 


HEDV 

(SEQ ID NO:27) 


ESMH 

(SEQ ID NO:25) 


19.3 


18.5 


ETGS 

(SEQ ID NO:28) 


GVNE 

(SEQ ID NO:26) 


14.4 


10.2 


HYGV 

(SEQ ID NO:29) 


DWPA 

(SEQ ID NO:24) 


25.0 


23.6 


DFGV 

(SEQ ID NO:30) 


DWPA 

(SEQ ID NO:24) 


25.0 


22.0 



The test rule is to test candidate peptides that are similar to the best 
10 members from the first test library. Thus, in this example, HYGV (SEQ ID NO:29) 
and DFGV (SEQ ID NO:30) may be synthesized and tested. If either or both of 
the peptides satisfy the test requirement, the compound screening process may 
be stopped at this point. Alternatively, if a compound has not yet been identified, 
or if additional compounds are desired, the process can be continued in an 
15 iterative fashion. As a further alternative, the selection and screening process 
can be continued using a different relationship, e.g., a QSAR relationship as 
described above. Finally, as described above, if the first screening yields a 
suitable compound, it may not be necessary to engage in successive rounds of 
picking a library and screening additional test compounds. 
20 Referring to FIG, 4A and 4B, the process of identifying a peptide as a 

component of culture medium using nearest neighbor analysis is graphically 
represented. After the actual indicia of the property have been measured, the 
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indicia (y-axis) for each peptide (x-axis) may be plotted in ascending (or 
conversely, in descending) order, FIG, 4A. Those compounds that satisfy the 
test requirement (in this case, compounds that provide indicia of the property of 
greater than 10 units when added to culture medium) are selected as lead 
5 compounds and the parameter space surrounding some or all of these leads is 
explored further. 

FIG. 4B demonstrates nearest neighbor analysis of a particular lead 
peptide. For illustrative purposes only, two parameters (e.g., total dipole and 
hydrophobicity) are employed for the analysis. The standardized values (as 

10 described above) for the two parameters are plotted on the x- and y-axis. 

Concentric circles can be drawn through the parameter space to represent a 
particular cut-off in Euclidean distance from the lead peptide. In one particular 
embodiment, a space-filling design is used to find points in parameter space as 
indicated by the x's and test the peptides (circles) closest to these points. The 

15 reason for extending the space around the lead peptide (/.e., concentric circles) is 
to gather information as to how close peptides must be in parameter space to 
exhibit similar activities, characteristics, or indicia of the property(ies) of interest. 

The cut-off distance will generally be established for each lead compound. 
Typically, if the data measured on the first plurality of culture media are clustered 

20 together, the cut-off distance will be relatively smaller than if the data points are 
spread out. Once the cut-off distance has been determined, then a second 
library of second test compounds that fall within the cut-off space can be 
identified. The second test compounds are predicted to provide indicia of the 
property that are similar to or better than the closest lead compound. All or a 

25 subset of the second test compounds in the second test library are added to a 
second plurality of culture media and the actual indicia of the property are 
measured. For example, a space-filling design can be used to select less than all 
of the second test library for screening. From this second data set, a final 
compound for identifying a culture medium component may be identified. 

30 Alternatively, a second set of lead compounds can be determined, and nearest 
neighbor analysis (or some other relationship, e.g., as described by FIG. 1) can 
be used to identify a third set of compounds for screening. The screening 
process can continue as many times as necessary to identify compounds 
exhibiting acceptable or suitable indicia of the property(ies). 
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It will be appreciated by those skilled in the art that additional operations 
can be performed to further optimize the above-described methods for identifying 
a culture medium component. For example, at any stage in the screening 
process, the number or types of parameters can be changed. In particular, in 
5 preferred embodiments, redundant parameters will be identified and eliminated. 
It is advantageous to identify those parameters that enhance the discrimination 
among compounds and eliminate redundant parameters to minimize computation 
complexity and time and to reduce computer storage space. Redundancy of 
parameters can be determined by any method known in the art, e.g., Principal 

10 Component Analysis. Typically, redundant parameters will be highly correlated 
with an already-existing parameter. 

As a further example of process optimization, a cocktail of compounds for 
use in formulating culture medium can be identified. A cocktail of compounds 
may give improved results over a single compound alone {e.g., synergy). In 

15 addition, once a compound(s) for addition to the culture medium has been 
identified, the base formulation may be reformulated to further improve the final 
medium. 

Alternatively, reformulation of the base medium may occur at any point in 
the screening process. Those skilled in the art will appreciate that in some 

20 situations it will be advantageous to formulate the base medium so that relatively 
modest changes in the property of interest can be detected as the effects of lead 
compounds identified at the early stages of the screening process (which may be 
relatively small) are masked by some base medium formulations. In other words, 
a less than optimum base medium may intentionally be selected, at least at the 

25 initial stages of the identification process, to maximize the observed impact of the 
test compounds on cell performance. 

A common reason for reformulating media is to move to a defined or at 
last a semi-defined media, which will typically reduce the variance in performance 
observed from one lot to the next. The more defined recipes are generally more 

30 complex and thereby are labor intensive to prepare. In preferred embodiments of 
the invention, the addition of the compound(s) identified by the inventive methods 
will permit the omission or, alternatively, the reduction of one or more medium 
components. The omitted or reduced component may be an undefined 
component, e,gr., serum or a protein hydrolysate. Alternatively, one or more 
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defined media components may be reduced or omitted, e.g., an amino acid, 
vitamin, mineral, carbohydrate source, lipid source, and the like. It is also 
desirable to remove or reduce the presence of certain medium components to 
minimize production costs and to simplify production processes and quality 
5 control. Typically, this process would occur in the final stages of media 
optimization. Components of the base culture medium would be removed, e.g., 
one or more components at a time, and the property of interest assayed. An 
experimental design such as a fractional factorial design may be used to assess 
the contribution of the particular component(s) to the overall performance of the 

10 medium. Those components that have no significant effect on the property of 
interest can be removed or reduced in the final culture medium formulation. 
Alternatively, it may be decided that a component does have an effect on the 
property of interest, but it is omitted or reduced in the formulation because of 
other considerations, e.g., cost, contaminants, and the like, /e., the advantages 

15 are outweighed by the disadvantages of maintaining the component at its current 
level. 

In the screening process, it may further be preferred to screen compounds 
in both a chemically-undefined culture medium and a more chemically-defined 
culture medium. This parallel screening need not be carried out concurrently. 

20 During the process of reformulating the medium used to culture a particular cell, 
tissue, organ, etc., it may be preferable to avoid changes to the cellular 
population in response to the new medium (/.e., subcloning the population). For 
example, in the process of moving cultured cells from a complex culture medium 
containing protein hydrolysates to a more defined medium in which peptides are 

25 substituted for the protein hydrolysate, changes may be seen in the population of 
cultured cells. Those skilled in the art will appreciate that it may be desirable that 
the cultured cells retain the ability to grow in the complex medium (with or without 
the compound). The parallel screening process described above will assist in 
maintaining the characteristics of cells cultured therein. In addition, the 

30 maintenance of cell viability and growth in undefined and defined media provides 
an opportunity to identify compounds that enhance performance in both types of 
media. 

The terms "chemically defined" and "chemically undefined" culture media 
are used herein according to their commonly accepted meanings in the art. In 
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general, a "chemically defined" culture medium is a medium formulation in which 
essentially all of the components therein are known and are present in known 
concentrations. Alternatively stated, a "chemically defined" culture medium is 
one in which essentially all of the components can be described in terms of their 
5 chemical formulas and are present in known concentrations. A "chemically 
undefined" culture medium is one in which the identity and/or concentration of 
some medium component is unknown and only proportionate values such as total 
amino nitrogen are obtainable. Thus, any medium containing an undefined 
component also becomes undefined. The term "semi-defined" is typically used in 

10 the art to describe a medium that contains only a small amount of undefined 
material. Examples of undefined components commonly used in media are yeast 
extract and fetal calf serum. 

In addition, the cells may be "conditioned" or "adapted" prior to the 
compound screening process by cycling the cells at least once through their 

15 current growth medium and the base medium that will be used for the screening 
process. Typically, the current growth medium is an undefined or semi-defined 
medium, while the base medium for screening is chemically defined. This 
conditioning/adaptation process will increase the likelihood that cells will grow in 
both their former medium as well as in the new base medium. 

20 Conditioning/adapting a cell line can enhance the reproducibility of the growth 
assay and thereby increase assay resolution. In addition, using cells that can 
grow in both chemically defined and undefined media provides the opportunity to 
identify medium components for types of media. 

As a further optional step, the cells may undergo one or two periods of 

25 incubation in base medium alone {i.e., lacking the test compound(s)) prior to 
being exposed to the test compound(s). Passing the cells through one or more 
incubation periods in base medium has a two-fold effect. First, it prevents carry- 
over of undefined components from the previous culture medium, which carry- 
over may skew the screening results. Second, the incubation period allows the 

30 cell to adapt to the new basal medium, so that the measured results should be 
reflective of the individual test compounds. 

It is not necessary that the inventive methods described herein result in a 
fully-defined culture medium. The final medium formulation may contain serum, 
protein hydrolysate, or other undefined medium components. 
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Moreover, the inventive screening methods described herein may be 
carried out with a base medium from which undefined protein components are 
absent. Alternatively, the base medium may contain an undefined protein 
component(s), as long as the effects of lead compounds are not masked by the 
5 presence of the undefined protein component. In one particular embodiment in 
which a medium for culturing bacteria is being formulated, the base medium 
contains less than about 10% (w/v) (e.g., from about 0.0001% to about 10% 
(w/v)), preferably about 0.1% (w/v) to about 2.5% (w/v), more preferably about 
0.25% (w/v) to about 1% (w/v), of an undefined protein component. In another 

10 particular embodiment in which a medium for culturing animal (e.g., mammalian) 
cells is being formulated, the base medium used in the screening process 
contains serum. Preferably the serum concentration is from less than about 30% 
(v/v), more preferably less than about 20%(v/v), in the base culture medium. 
Alternatively, the serum concentration is preferably from about 0.05% to about 

15 30% (v/v), more preferably 1% (v/v) to about 30% (v/v), still more preferably 
about 5%(v/v) to about 20% (v/v) in the base culture medium. 

Exemplary undefined protein components include but are not limited to 
hydrolysates (e.g., produced by chemical cleavage such as a casamino acid), 
digests (e.g., enzymatic digests, such as tryptone, proteose peptone, and the 

20 like), extracts (e.g., yeast extract) and infusions (e.g., organ or tissue infusions, 
such as brain-heart infusions), as those terms are understood in the art. The 
starting material for the undefined protein component is typically yeast, 
slaughterhouse offal, milk proteins or other proteins (e.g., gelatin), tissues, or 
organs. Exemplary sources of sera include but are not limited to fetal calf serum, 

25 horse serum, and the like. 

Another preferred embodiment of the present invention is an apparatus for 
identifying a compound(s) as a component for culture medium based on at least 
one parameter(s) of the compound(s). A preferred apparatus 500 is shown in 
FIG, 5. This preferred apparatus comprises means 502, such as the afore- 

30 mentioned measurement tool, for measuring indicia of the property of interest 
from a plurality of culture media each containing a test compound(s). As 
described above, any measurement tool known in the art may be used to take 
measurements as described above, e.g., a spectrophotometer for absorption or 
colorimetric measurements, a fluorometer or flow cytometer for fluorescence 
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measurements, a scintillation or gamma counter for radioactive measurements, 
and an automated cell counter, automated plate counter, or manual plate counter 
for cell number measurements. As a further example, a microwell reader can be 
used for fluorescence, absorbance or colorimetric measurements. In the 
5 preferred apparatus 500, the measurement tool is a spectrophotometer, more 
preferably, a microwell reader for measuring optical density. 

The apparatus 500 also operates under computer control. In particular, 
the measurement tool 502 is preferably operatively coupled to a general purpose 
or application specific computer controller 504. The controller 504 preferably 

10 comprises a computer program product(s) for controlling operation of the 
measurement tool 502 and performing numerical operations relating to the 
above-described steps. The controller 504 may accept set-up and other related 
data via a file 506, disk input 508, or data bus 510. A display 512 and printer 514 
are also preferably provided to visually display the operations performed by the 

15 controller 504. 

It will be understood by those having skill in the art that the functions 
performed by the controller 504 may be realized in whole or in part as software 
modules running on a general purpose computer system. Alternatively, a 
dedicated stand-alone system with application-specific integrated circuits for 

20 performing the above-described functions and operations may be provided. In 
particular, a preferred computer program product will comprise a computer 
readable storage medium having computer-readable program code means 
embodied in the medium. The preferred computer-readable program code 
means comprises computer-readable program code means for performing the 

25 operations described with respect to FIGS- 1 and 3 and throughout the present 
description. 

Turning to Fig, 6, another aspect of the present invention is a method of 
defining a compound library based on a whole molecule descriptor(s) of the test 
compounds, Block 600. The compound library is preferably representative of the 
30 larger compound space from which it is derived. Thus, this embodiment of the 
invention provides a method of exploring a relatively large compound space more 
efficiently and may furthermore obviate computational, time, cost or other 
restraints. This aspect of the invention can be used with any compound space, in 
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particular, peptide, protein, carbohydrate, nucleic acid, and lipid (e.g., free fatty 
acids, triglycerols, steroids) compound spaces. Peptide spaces are preferred. 

According to the present invention, operations are performed to reduce 
(/.e., contract) the compound space by classifying all test compounds in the 
5 space as a single candidate compound based on a shared global parameter(s) 
(/.e., a whole molecule parameter), Block 602, The group of compounds sharing 
common global characteristics are termed "compound isomers". For example, all 
peptides with the same amino acid composition {i.e., have the same molecular 
weight) can be grouped as a respective candidate compound. As a further 

10 alternative, all peptides having the same chemical formula can be grouped as a 
single candidate compound (will include peptides with different amino acid 
sequences, e.g., SWW and GIILS, C23H43N5O7). 

In the case of a peptide space, the present investigations have found that 
a number of parameters may be determined that are independent of peptide 

15 sequence. These include, but are not limited to, total charge, molecular weight, 
logP (/.e., hydrophobicity), and total dipole. In such instances, the order of the 
amino acids in the sequence is irrelevant, and multiple peptide sequences may 
be represented by a common sequence. For example, the three peptides AKA, 
AAK, and KAA contain the same amino acids in the same relative proportions. 

20 Consequently, all have the same molecular weight, total charge, etc., even 
though their sequences are different. Therefore, all three peptides may be 
represented by a single sequence, such as AAK. 

Thus, while AKA, AAK, and KAA are unique structures, their "global" 
properties may be sufficient for modeling activity. Consequently, all three 
25 sequences would yield the same result (e.g., biological activity). Only one 
peptide may be needed to provide the desired results, and two would be 
redundant. As a result, fewer peptide sequences would be needed to describe 
the property space of all peptides of a given length. 

In one particular embodiment, as shown in FIG. 6, operations are 
30 performed to select less than all of the candidate compounds based on a space- 
filling design and to re-expand the selected candidate compounds into their 
constitutive compound isomers, Block 604. Operations to re-expand the 
compounds function to re-introduce isomer-specific parameters {i.e., sequence- 
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specific parameters). Operations are performed to select the test library from the 
re-expanded set of compounds using any method known in the art, Block 606. In 
particular embodiments, at least one (preferably one) compound is selected from 
each of the expanded groups of compound isomers. 
5 Preferably, the selected test library is representative of the larger 

compound space, and, likewise, it is preferred that the results achieved by 
evaluation of the test library are representative of those that would be obtained if 
the entire compound space was evaluated. Compound libraries generated 
according to this method can be used for any purpose known in the art, e.g., drug 

10 design, media formulation, and the like. 

The foregoing aspect of the invention can be illustrated using the following 
simplified example with model data. The present invention may be used to 
evaluate a peptide space containing all possible peptide tetramers containing the 
amino acids A and/or C. There are sixteen possible tetramers if only these two 

15 amino acids are utilized (Table 10, column 1). The peptide space containing 
these sixteen compounds may be contracted by grouping those peptides sharing 
the same chemical formula {i.e., termed "compound isomers") as five candidate 
compounds (Table 10, column 2). All peptides with this same whole molecule 
characteristic (e.g., chemical formula) are treated as a single peptide (i.e., 

20 candidate compound) with the same properties. Using any method known in the 
art (e.g., a space-filling design), two of the five candidate compounds may be 
selected (Table 10, column 3). The two selected candidate compounds may 
then be re-expanded into the ten individual compound isomers based on their 
sequence (Table 10, column 4). In the final step, one of the individual 

25 compound isomers is selected from each group (for a total of two peptides) to 
form the peptide library (Table 10, column 5). 



30 
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Table 10 



SEQ ID NO:32 


AAAA ^ 


AAAA 




















SEQ ID NO:33 


AAAC ^ 


AAAC ^ 


AAAC ^ 


AACA 




SEQ ID NO:34 


AACA 






AACA 




SEQ ID NO:35 


ACAA 






ACAA -> 


ACAA 


SEQ ID NO:36 


CAAA 






CAAA 
















SEQ ID NO:37 


AACC ^ 


AACC ^ 


AACC ^ 


CCAA 




SEQ ID NO:38 


ACAC 






ACAC 




SEQ ID NO;39 


CAAC 






CAAC ^ 


CAAC 


SEQ ID NO:40 


ACCA 






ACCA 




SEQ ID N0:41 


CACA 






CACA 




SEQ ID NO:42 


CCAA 






CCAA 
















SEQ ID NO:43 


ACCC 


ACCC 








SEQ ID NO:44 


CACC 










SEQ ID NO:45 


CCAC 










SEQ ID NO:46 


CCCA 






















SEQ ID NO:47 


cccc 


CCCC 









5 In this simplified example, the number of peptides has been reduced from 

sixteen to two. However, in a more realistic and complex set of circumstances the 
reduction in the compound space will be significantly greater. Table 11 
demonstrates the number of unique peptide sequences possible with increasing 
peptide length. There is an exponential increase in the number of different peptide 
10 sequences possible from the twenty naturally-occurring amino acids with an 
increase in the size of the peptide. Given X amino acids and a total of Y residues in 
a peptide sequence, the total number of possible sequences can be calculated as 
X^. The total number of peptide sequences for dipeptides through heptapeptides 
using 20 amino acids are listed in Table 11. 

15 
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Table 11 



NumDer or 


1 OTal IMUmDer 


Residues 


of Peptides 


2 


400 


3 


8000 


4 


160,000 


5 


3,200,000 


6 


64,000,000 


7 


1,280,000,000 



5 

For example, 8000 different peptide trimers {i.e., sequences of three 
amino acids) are possible containing the twenty amino acids. This number 
increases to 64,000,000 for hexapeptides and 1,280,000,000 for heptapeptides. 
Accordingly, it can be seen that with increasing complexity in compound size and 

10 composition, the total compound space can become too vast for efficient 
exploration thereof. 

Given limitations in computational power and disk space, the total number 
of possible pentamer and longer peptides may be impractical for calculation of 
sequence-dependent physical parameters. This is significant when the property 

15 of interest is dependent on the sequence of the amino acids in the peptide. In 
such a case, the peptide may be viewed as a series of ordered building blocks. 
This is the common approach in peptide drug design, where the peptide fits into a 
highly defined receptor. 

Table 12 demonstrates the reduction in the compound space achieved by 
20 grouping peptides having a common chemical formula into compound isomers. 
Although this table was derived empirically, later examination revealed it to be 
Pascal's triangle (Kotz and Johnson, Eds-in-Chief (1985) Encyclopedia of 
Statistical Sciences, Vol. 6, pp. 628-630, John Wiley & Sons, New York). As one 
example, the number of peptide heptamers possible from different combinations 
25 of the twenty amino acids is reduced to 657,800 by grouping peptides into 
isomers, as compared with 1,280,000,000 (see Table 11) without grouping of 
peptide isomers. 
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Table 12 



ResiduGs 


1 


2 


3 


4 


5 


6 


7 


1 


1 


1 


1 


1 


1 


1 


1 


2 


2 


3 


4 


5 


6 


7 


8 


3 


3 


6 


10 


15 


21 


28 


36 


4 


4 


10 


20 


35 


56 


84 


120 


5 


5 


15 


35 


70 


126 


210 


330 


6 


6 


21 


56 


126 


252 


462 


792 


7 


7 


28 


84 


210 


462 


924 


1716 


8 


8 


36 


120 


330 


792 


1716 


3432 


9 


9 


45 


165 


495 


1287 


3003 


6435 


10 


10 


55 


220 


715 


2002 


5005 


11440 


11 


11 


66 


286 


1001 


3003 


8008 


19448 


12 


12 


78 


364 


1365 


4368 


12376 


31824 


13 


13 


91 


455 


1820 


6188 


18564 


50388 


14 


14 


105 


560 


2380 


8568 


27132 


77520 


15 


15 


120 


680 


3060 


11628 


38760 


116280 


16 


16 


136 


816 


3876 


15504 


54264 


170544 


17 


17 


153 


969 


4845 


20349 


74613 


245157 


18 


18 


171 


1140 


5985 


26334 


100947 


346104 


19 


19 


190 


1330 


7315 


33649 


134596 


480700 


20 


20 


210 


1540 


8855 


42504 


177100 


657800 



5 Turning to FIG. 7, the foregoing method of selecting a compound library 

can be used to identify a compound as a culture media component based on the 
parameters of the compound as described above, Block 700. A decision is made 
as to whether all compounds in the compound space need to be screened, Block 
702. If it is determined that less than all of the compounds will be tested, 

10 operations are performed to reduce the compound space by representing each 
group of compound isomers within the compound space as a respective 
candidate compound, Block 704, and to then select less than all of the candidate 
compounds, Block 706. The decision to test less than all of the compounds in 
the compound space may be based on any criterion or set of criteria, e.g., cost 

15 considerations, time considerations, computational limitations, the nature of the 
particular property of the compounds being evaluated, etc. 

The selected candidate compounds can be screened directly if sequence- 
specific parameters of the compounds are not deemed important to the selection 
process. Thus, a decision must be made as to whether sequence-specific 

20 parameters should be considered in the screening strategy, Block 708. If 
sequence-specific parameters are to be considered, then the compounds are 
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expanded into the constituent compound isomers by re-introduction of sequence- 
specific parameters, Block 710. At this point, all of the expanded compound 
isomers can be screened, or, alternatively less than all of the expanded 
compound isomers are selected, Blocks 712 and 714. Again, the decision to test 
5 all or less than all of the expanded compound isomers can be made on any 
basis. Finally, at whatever point in the decision process described above it is 
decided that a suitable set of compounds for screening has been identified, 
indicia of a selected property(s) is measured for a plurality of culture media, each 
containing a respective test compound, Block 716. Operations to identify a 

10 compound for use as a culture medium component can be carried out as 
described above, e.g., in FIG. 1 and FIG. 3. 

Turning to FIG. 8, as another preferred embodiment, the invention 
provides a method of predicting the activity (e.g., biological activity) of a peptide 
based on at least one whole molecule parameter of the peptide (as defined 

15 above), Block 800. Those skilled in the art will appreciate sequence-specific 
parameters may also be considered in addition to the at least one whole 
molecule parameter. As used herein, a "biological activity" includes 
pharmacological and biochemical activities. This method can be used to identify 
and/or design peptides with particular activities for use as therapeutic drugs (e.g., 

20 for medical or veterinary uses), in developing culture media components (as 
described in more detail above), for identifying and/or designing peptides that 
interact with a target molecule (e.g., receptor agonists or antagonists) or cell, and 
in identifying and/or designing peptides that induce or enhance, alternatively, 
prevent or inhibit, any activity of a target protein (e.g., a receptor, enzyme, 

25 signaling protein, cell-surface protein, nucleoprotein, ribosomal protein, and the 
like), cell, or nucleic acid (e.g., DNA, rRNA, mRNA, tRNA). 

Accordingly, it will be apparent to one of skill in the art that this 
embodiment of the invention can be performed with cultured cells, tissues or 
organs. Alternatively, this embodiment can also be carried out in cell-free 

30 systems (e.g., lysed cells, cell fractions, or biochemically-defined systems such 
as purified enzymes or receptors). 

According to this embodiment of the invention, indicia of a biological 
activity of interest of a plurality of test peptides from a first test library are 
measured. Block 802., Peptide libraries are as described hereinabove. The 
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peptides are chosen from the first test library based on a space-filling design, as 
described in more detail hereinabove. Moreover, the first test peptide may be 
identified by first reducing a larger peptide space by grouping ail of the peptides 
therein according to at least one whole molecule parameter, e.g., as described 
5 above and in FIG. 6. 

Indicia of the biological activity may be measured using any suitable 
method known in the art, as discussed hereinabove with respect to Block 102. 

A relationship (e.g., a mathematical relationship) is determined between 
the measured indicia of the biological activity and at least one whole molecule 

10 parameter (e.g., descriptor) of the plurality of first test peptides, Block 804. 
Operations to determine the relationship are carried out by any means known in 
the art, preferably, as described above in Blocks 104 and 306. In determining the 
relationship, sequence-specific parameters {i.e., not whole molecule parameters) 
may also be considered. Any whole molecule or sequence-specific parameter 

15 known in the art for describing peptides may be used to determine the 
relationship. Whole molecule parameters include, but are not limited to, total 
charge, molecular weight, isoelectric point, total dipole moment, isotropic surface 
area, electronic charge, and hydrophobicity. In particular embodiments, at least 
two whole molecule parameters are employed to describe each of the test 

20 peptides. In an alternate preferred embodiment, the peptides are described 
using at least two whole molecule parameters, where the first parameter is 
molecular weight and the second is total charge, isoelectric point, total dipole 
moment, isotropic surface area, electronic charge index, or hydrophobicity. 

Referring again to FIG. 8, a test requirement related to the measured 

25 indicia of the biological activity is determined, Block 806. Operations to 
determine the test requirement are performed essentially as described for Blocks 
106 and 304. The biological activity of a second peptide or plurality of peptides 
not within the first set of test compounds can be identified based on the 
relationship between the biological activity and the whole molecule parameter(s) 

30 of the second peptide(s), so as to identify a peptide(s) that is expected to provide 
indicia of the biological activity that satisfies the test requirement. 

In the drawings and specification, there have been disclosed typical 
preferred embodiments of the invention and, although specific terms are 
employed, they are used in a generic and descriptive sense only and not for 
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purposes of limitation, the scope of the invention being set forth in the following 
claims. Moreover, the terminology in the present description relating to graphs, 
plotting lines, determining a relationship, determining a "best fit" line, representing 
compound isomers as a candidate compound, expanding candidate compounds 
into their constituent compound isomers, etc. is intended to include the 
processing of data and parameters internal to a processing unit (e.g., a 
computer) containing memory and not limited to the physical acts of printing or 
plotting lines, curves and graphs. 
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What is Claimed is: 

1. A method of identifying a culture medium component, comprising 
the steps of: 

measuring first indicia of a property of a plurality of first culture media 
which each contains a respective first test compound from within a first test 
5 library; 

determining a relationship between at least one parameter of the first test 
compounds and the measured first indicia of the property of the plurality of first 
culture media; 

determining a test requirement relating to the measured first indicia; and 
1 0 identifying a second test library containing a plurality of second test 

compounds as components of a plurality of second culture media which based on 
the relationship are expected to provide second indicia of the property which 
meets the test requirement. 

2. The method of Claim 1 , wherein the plurality of first test compounds 
is selected from the first test library using a space-filling technique; and wherein 
the plurality of second test compounds is selected from the second test library 
using a space-filling technique. 

3. The method of Claim 1 , wherein said step of determining a 
relationship comprises the step of determining y , = f(Xij), where Xy- denotes a 
parameter, i ranges from 1 to n where n represents the number of first culture 
media in the plurality thereof, j ranges from 1 to d where d represents the number 

5 of parameters, and y , represents an estimate of the measured first indicia of the 
property of the plurality of first culture media. 

4. The method of Claim 3, wherein said step of determining a test 
requirement comprises the step of determining a range of acceptable indicia of 
the property. 

5. The method of Claim 4, wherein said identifying step comprises 
determining from y , = f(Xij) estimated indicia of the property of a plurality of 
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second culture media which each contains a respective test compound; and 
wherein at least one of the second culture media contains a test compound that 
5 is not within the first test library. 

6. The method of Claim 5, wherein said identifying step further 
comprises: 

determining which of the estimated indicia are within the range of 
acceptable indicia; and 
5 determining from the estimated indicia that are within the range, the 

plurality of second test compounds from the second test library. 

7. The method of Claim 6, further comprising the step of measuring 
second indicia of the property of the plurality of second culture media. 

8. The method of Claim 7, wherein f(Xij) is a non-parametric regression 
formula. 

9. The method of Claim 3, wherein f(Xij) is a non-parametric regression 
formula. 

10. The method of Claim 1 , wherein said step of determining a 
relationship comprises the step of: 

determining a distance function d(xi, X2) between a first value of a 
parameter, xi, of a first test compound and a second value of the parameter, xz, 
5 of a second test compound not within the plurality of first test compounds; and 
estimating indicia of the property of a culture medium containing the 
second test compound as the indicia of the property of the culture medium 
containing the first test compound if d(xi, X2) < dcutofn, where dcutoffi is a cutoff 
distance for the first test compound. 

1 1 . The method of Claim 1 , wherein said measuring step is preceded 
by a step of defining a first test library by representing each of a plurality of 
groups of compound isomers from a compound space as a respective candidate 
compound. 
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12. The method of Claim 1 1 , further comprising the step of expanding 
less than all of the candidate compounds determined in said representing step 
into their constituent compound isomers using a space-filling technique. 

1 3. The method of Claim 1 , wherein the at least one parameter is 
selected from the group consisting of whole molecule, sequence-specific, and 
topological parameters. 

14. The method of Claim 1 , wherein the at least one parameter is a 
whole molecule parameter. 

15. The method of Claim 14, wherein the whole molecule parameter is 
selected from the group consisting of total charge, molecular weight, isoelectric 
point, total dipole moment, isotropic surface area, electronic charge index, and 
hydrophobicity. 

16. A culture medium component identified by the method of Claim 1 . 

17. A culture medium comprising the culture medium component of 
Claim 16. 

18. The method of Claim 1 , wherein the first and second test libraries 
are selected from the group consisting of peptide, polynucleotide, nucleic acid, 
carbohydrate, free fatty acid, and lipid libraries. 

1 9. The method of Claim 1 , wherein the first and second test libraries 
are test peptide libraries. 

20. The method of Claim 19, wherein the first and second test peptide 
libraries consist of peptides having a length in a range from about four amino 
acids to about twenty amino acids. 
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21 . The method of Claim 19, wherein the peptides in the first test library 
comprise at least one amino acid position that is nonvariable or is designated by 
a limited number of possible amino acids. 

22. The method of Claim 19, wherein the peptides in the second test 
library comprise at least one amino acid position that is nonvariable or is 
designated by a limited number of possible amino acids. 

23. The method of Claim 1 , wherein said step of measuring first indicia 
is preceded by the step of forming a plurality of cell cultures which each contains 
a respective first culture medium from within the plurality thereof. 

24. The method of Claim 23 further comprising the step of conditioning 
the cell cultures to grow in both chemically-undefined and chemically-defined 
media prior to said measuring step. 

25. The method of Claim 23, wherein the cell cultures are selected from 
the group consisting of mammalian, insect, plant, fungal, yeast, protozoan and 
bacterial cell cultures. 

26. The method of Claim 23, wherein the plurality of culture media are 
chemically-defined culture media. 

27. The method of Claim 23, wherein the measured property of the 
plurality of first culture media is the ability to alter the growth, proliferation, 
maturation or differentiation of cultured cells. 

28. The method of Claim 23, wherein the measured property of the 
plurality of first culture media is the ability to alter peptide or protein production by 
cultured cells. 

29. The method of Claim 28, wherein the peptide or protein is selected 
from the group consisting of antigens, toxins, antibodies, hormones, growth 
factors, cytokines, clotting factors, and enzymes. 



56 



30. The method of Claim 23, wherein the measured property of the 
plurality of first culture media is the ability to alter the production of a compound 
selected from the group consisting of antibiotics, steroids, carbohydrates lipids 
and nucleic acids by cultured cells. 

31 . A method of defining a test compound library, comprising the step 
of representing each of a plurality of groups of compound isomers from within a 
compound space as a respective candidate compound. 

32. The method of Claim 31 , wherein said representing step is followed 
by the steps of: 

selecting less than all of the candidate compounds using a space-filling 
technique; and 

expanding the less than all of the candidate compounds determined in 
said selecting step into their constituent compound isomers. 

33. The method of Claim 32, further comprising the step of selecting at 
least one constituent compound isomer for each of the candidate compounds. 

34. The method of Claim 31 , wherein the compound space and test 
compound library consist of compounds selected from the group consisting of 
peptides, polynucleotides, nucleic acids, carbohydrates, free fatty acids, and 
lipids. 

35. The method of Claim 31 , wherein the compound space is a peptide 
space and the test compound library is a test peptide library. 

36. A test compound library wherein each of a plurality of compound 
isomers from within a compound space is represented as a respective candidate 
compound. 

37. The test compound library of Claim 36, wherein the compound 
space is a peptide space and the test compound library is a test peptide library. 
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38. A test compound library formed by the method of Claim 31 . 

39. A test compound library formed by the method of Claim 32. 

40. The test compound library of Claim 39, wherein the compound 
space is a peptide space and the test compound library is a test peptide library. 

41 . A method of identifying a culture medium component, comprising 
the steps of: 

defining a first test library by representing each of a plurality of groups of 
compound isomers from within a compound space as a respective candidate 
compound; 

measuring first indicia of a property of a plurality of first culture media 
which each contains a respective first test compound from within the first test 
library; 

determining a relationship between at least one parameter of the first test 
compounds and the measured first indicia of the property of the plurality of first 
culture media; 

determining a test requirement relating to the measured first indicia; and 
identifying a plurality of second test compounds from within the compound 
space as components of a plurality of second culture media which based on the 
relationship are expected to provide second indicia of the property which meets 
the test requirement. 

42. The method of Claim 41 , wherein said defining step further 
comprises expanding the less than all of the candidate compounds determined in 
said representing step into their constituent compound isomers using a space- 
filling technique. 

43. The method of Claim 42, wherein said defining step further 
comprises selecting at least one constituent compound isomer from each of the 
candidate compounds. 
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44. The method of Claim 41 , wherein the plurality of first test 
compounds is selected from the first test library using a space-filling technique. 

45. The method of Claim 41 , wherein said step of determining a 
relationship comprises the step of determining y , = f(x,j), where xy denotes a 
parameter, i ranges from 1 to n where n represents the number of first culture 
media in the plurality thereof, j ranges from 1 to d where d represents the number 

5 of parameters, and y, represents an estimate of the measured first indicia of the 
property of the plurality of first culture media. 

46. The method of Claim 45, wherein said step of determining a test 
requirement comprises the step of determining a range of acceptable indicia of 
the property. 

47. The method of Claim 46, wherein said identifying step comprises 
determining from y, = f(Xjj) estimated indicia of the property of a plurality of 
second culture media which each contains a respective test compound, wherein 

5 at least one of the second culture media contains a test compound that is not 
within the first test library. 

48. The method of Claim 47, wherein said identifying step further 
comprises: 

determining which of the estimated indicia are within the range of 
acceptable indicia; and 
5 determining from the estimated indicia that are within the range, the 

plurality of second test compounds from the compound space. 

49. The method of Claim 45, wherein f(Xij) is a non-parametric 
regression formula. 

50. The method of Claim 41 , wherein said step of determining a 
relationship comprises the step of: 
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determining a distance function d(xi, X2) between a first value of a 
parameter, Xi , of a first test compound and a second value of the parameter, X2, 
of a second test compound not within the plurality of first test compounds; and 

estimating indicia of the property of a culture medium containing the 
second test compound as the indicia of the property of the culture medium 
containing the first test compound if d(xi, X2) < dcutotn, where dcutoffi is a cutoff 
distance for the first test compound. 

51 . The method of Claim 41 , wherein the at least one parameter is 
selected from the group consisting of whole molecule, sequence-specific, and 
topological parameters. 

52. The method of Claim 41 , wherein the at least one parameter is a 
whole molecule parameter. 

53. The method of Claim 41 , wherein the compound space is selected 
from the group consisting of peptide, polynucleotide, nucleic acid, carbohydrate, 
free fatty acid, and lipid spaces. 

54. The method of Claim 41 , wherein the compound space is a peptide 

space. 

55. A culture medium component identified by the method of Claim 41 . 

56. A culture medium comprising the culture medium component of 
Claim 55. 

57. The method of Claim 41 , wherein said step of measuring first indicia 
of the property comprises adding each of the plurality of first culture media to a 
respective cell culture to form a plurality of cell cultures each containing a 
respective culture medium containing a respective first test compound. 
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58. The method of Claim 57, wherein the plurality of cell cultures is 
selected from the group consisting of mammalian, insect, plant, fungal, yeast, 
protozoan and bacterial cell cultures. 

59. A method of predicting indicia of a property of a peptide, comprising 
the steps of: 

measuring indicia of an activity of a plurality of test peptides from a test 
peptide library; 

determining a relationship between the measured indicia of the activity and 
at least one whole molecule parameter of the plurality of test peptides; 

predicting the indicia of the activity of a peptide not within the plurality of 
test peptides based on the relationship. 

60. The method of Claim 59, wherein the first plurality of test peptides 
from the test peptide library is selected using a space-filling technique. 

61 . The method of Claim 59, wherein the at least one whole molecule 
parameter comprises a parameter selected from the group consisting of total 
charge, molecular weight, isoelectric point, total dipole moment, isotropic surface 
area, electronic charge index, and hydrophobicity. 

62. The method of Claim 59, wherein at least two whole molecule 
parameters of the plurality of test peptides are selected from the group consisting 
of total charge, molecular weight, isoelectric point, total dipole moment, isotropic 
surface area, electronic charge index, and hydrophobicity. 

63. The method of Claim 59, wherein the at least one whole molecule 
parameter comprises hydrophobicity, molecular weight, total dipole moment, and 
total charge. 

64. The method of Claim 59, wherein the at least one whole molecule 
parameter comprises molecular weight and at least one additional parameter 
selected from the group consisting of total charge, isoelectric point, total dipole 
moment, isotropic surface area, electronic charge index, and hydrophobicity. 
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65. The method of Claim 59, wherein the activity is binding to a 
receptor. 

66. The method of Claim 59, wherein the activity is enhancement or 
inducement of a biological activity in a cell. 

67. The method of Claim 59, wherein the activity is inhibition or 
prevention of a biological activity in a cell. 

68. The method of Claim 66 or Claim 67, wherein the cell is a cell 
cultured in vitro. 

69. The method of Claim 68, wherein said step of measuring indicia of 
the property comprises: 

forming a plurality of culture media that each contains a respective test 
peptide from the plurality thereof; and 

adding each of the plurality of culture media to a respective cell culture to 
form a plurality of cell cultures each containing a respective culture medium 
containing a respective test compound. 

70. The method of Claim 59, wherein the activity is enhancement or 
inhibition of a receptor. 

71 . The method of Claim 59, wherein the activity is enhancement or 
inducement of activation of a receptor. 

72. The method of Claim 59, wherein the test peptide library consists of 
peptides having a length in a range from about four amino acids to about twenty 
amino acids. 

73. The method of Claim 59, wherein the test peptide library consists of 
peptides having a length in a range from about four amino acids to about ten 
amino acids. 
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74. A method of identifying a peptide with a predicted indicia of an 
activity that satisfies a test requirement, comprising the steps of: 

measuring first indicia of an activity of a plurality of first test peptides from 
a first test peptide library; 
5 determining a relationship between the measured first indicia of the activity 

and at least one whole molecule parameter of the plurality of first test peptides; 
determining a test requirement relating to the measured first indicia; and 
identifying a second test peptide library containing a plurality of second 
test peptides which based on the relationship are expected to provide second 
1 0 indicia of the activity that meets the test requirement. 

75. The method of Claim 74, wherein the plurality of first test peptides is 
selected from the first test peptide library using a space-filling technique. 

76. The method of Claim 74, wherein said step of determining a 
relationship comprises the step of determining y , = f(Xij), where xy denotes a 
parameter, i ranges from 1 to n where n represents the number of first test 
peptides in the plurality thereof, j ranges from 1 to d where d represents the 

5 number of whole molecule parameters, and y, represents an estimate of the 
measured first indicia of the activity of the plurality of first test peptides. 

77. The method of Claim 76, wherein said step of determining a test 
requirement comprises the step of determining a range of acceptable indicia of 
the activity. 

78. The method of Claim 77, wherein said identifying step further 
comprises: 

determining which of the estimated indicia are within the range of 
acceptable indicia; and 
5 determining from the estimated indicia that are within the range, the 

plurality of second test peptides from the second test peptide library. 



63 



79. The method of Claim 76, wherein f(Xij) is a non-parametric 
regression formula. 

80. The method of Claim 74, wherein said step of determining a 
relationship comprises the step of: 

determining a distance function d(xi, X2) between a first value of a whole 
molecule parameter, xi, of a first test peptide and a second value of the whole 
molecule parameter, X2, of a second test peptide not within the first test peptide 
library; and 

estimating indicia of the activity of the second test peptide as the indicia of 
the activity of the first test peptide if d(xi, X2) < dcutoffi, where dcutoffi is a cutoff 
distance for the first test peptide. 

81. The method of Claim 74, wherein said measuring step is preceded 
by the step of defining a first test peptide library by representing each of a 
plurality of groups of peptide isomers from a first peptide space as a respective 
candidate peptide. 

82. The method of Claim 81 , further comprising the step of expanding 
less than all of the candidate peptides determined in said representing step into 
their constituent compound peptides using a space-filling technique. 

83. The method of Claim 74, wherein the at least one whole molecule 
parameter is selected from the group consisting of total charge, molecular weight, 
isoelectric point, total dipole moment, isotropic surface area, electronic charge 
index, and hydrophobicity. 

84. The method of Claim 74, wherein at least two whole molecule 
parameters are selected from the group consisting of total charge, molecular 
weight, isoelectric point, total dipole moment, isotropic surface area, electronic 
charge index, and hydrophobicity. 
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85. The method of Claim 74, wherein the at least one parameter 
comprise hydrophobicity, molecular weight, total dipole moment, and total 
charge. 

86. The method of Claim 74, wherein the at least one whole molecule 
parameter is molecular weight and at least one additional parameter selected 
from the group consisting of total charge, isoelectric point, total dipole moment, 
isotropic surface area, electronic charge index, and hydrophobicity. 

87. The method of Claim 74, wherein the activity is binding to a 
receptor. 

88. The method of Claim 74, wherein the activity is enhancement or 
inducement of a biological activity in a cell. 

89. The method of Claim 74, wherein the activity is inhibition or 
prevention of a biological activity in a cell. 

90. The method of Claim 88 or Claim 89, wherein the cell is a cell 
cultured in vitro. 

91 . The method of Claim 90, wherein said step of measuring first indicia 
of the activity comprises: 

forming a plurality of culture media that each contains a respective test 
peptide from the plurality thereof; and 

adding each of the plurality of culture media to a respective cell culture to 
form a plurality of cell cultures each containing a respective culture medium 
containing a respective first test compound. 

92. The method of Claim 74, wherein the activity is inhibition or 
prevention of activation of a receptor. 

93. The method of Claim 74, wherein the activity is enhancement or 
inducement of activation of a receptor. 
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94. The method of Claim 74, wherein the test peptide library consists of 
peptides having a length in a range from about four amino acids to about twenty 
amino acids. 

95. The method of Claim 74, wherein the test peptide library consists of 
peptides having a length in a range from about four amino acids to about ten 
amino acids. 

96. A method of identifying a culture medium component, comprising 
the steps of: 

culturing a plurality of first cell cultures in a plurality of first culture media 
5 each containing a respective first test compound from a first test library; 

measuring first indicia of a property of the plurality of first culture media in 
the plurality of first cell cultures; 

determining a relationship between the measured first indicia of the 
property and at least one parameter of the plurality of first culture media; 
10 determining a test requirement relating to the measured first indicia; and 

culturing a plurality of second cell cultures in a plurality of second culture 
media each containing a respective second test compound from a second test 
library; 

wherein based on the relationship the plurality of second culture media 
1 5 containing the second test compounds are predicted to give indicia of the 
property that satisfy the test requirement. 

97. The method of Claim 96, further comprising the step of 
reformulating the culture medium containing the identified culture medium 
component to omit components. 

98. The method of Claim 97, wherein at least one component is omitted 
from the culture medium formulation. 

99. The method of Claim 96, wherein said step of culturing a plurality of 
first ceil cultures further comprises the steps of: 
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culturing the plurality of first cell cultures in a chemically-defined culture 
medium and a chemically-undefined culture medium each containing a test 
compound from the first test library; and 

comparing the measured indicia of the property for the same test 
compound in the chemically-defined culture medium and the chemically- 
undefined culture medium. 

1 00. The method of Claim 96, wherein said measuring step is preceded 
by the step of conditioning the cell cultures to grow in both chemically-undefined 
and chemically-defined media. 

101 . The method of Claim 96, wherein the plurality of first and second 
culture media comprise a concentration of an undefined protein component in a 
range from about 0.1% (w/v) to about 2.5%(w/v). 

102. The method of Claim 101, wherein the undefined protein 
component is selected from the group consisting of hydrolysates, digests, 
extracts, and infusions. 

1 03. The method of Claim 96, wherein the plurality of first and second 
culture media comprise a concentration of an undefined protein component in a 
range from about 0.25% (w/v) to about 1%(w/v). 

104. The method of Claim 96, wherein the plurality of first and second 
culture media comprise a concentration of serum in a range from about 0.05% 
(v/v) to about 30%(v/v). 

1 05. The method of Claim 96, wherein the plurality of first and second 
cell cultures is selected from the group consisting of mammalian, insect, plant, 
fungal, yeast, protozoan and bacterial cell cultures. 

106. The method of Claim 96, wherein the plurality of first and second 
culture media are chemically-defined culture media. 
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1 07. The method of Claim 96, wherein the plurality of first and second 
culture media are liquid culture media. 

108. The method of Claim 96, wherein the measured property of the 
plurality of first culture media is the ability to alter growth, maturation, 
proliferation, or differentiation of cultured cells. 

1 09. The method of Claim 96, wherein the measured property of the 
plurality of first culture media is the ability to alter peptide or protein synthesis by 
cultured cells. 

1 1 0. The method of Claim 109, wherein the peptide or protein is selected 
from the group consisting of antigens, toxins, antibodies, hormones, growth 
factors, cytokines, clotting factors, and enzymes. 

111. The method of Claim 96, wherein the measured property of the 
plurality of first culture media is the ability to alter the synthesis of a compound 
selected from the group consisting of antibiotics, steroids, carbohydrates, lipids 
and nucleic acids by cultured cells. 

112. The method of Claim 96, wherein the measured property of the 
plurality of first culture media is the ability to alter peptide or protein secretion by 
cultured cells. 

113. A culture medium component identified by the method of Claim 96. 

114. A culture medium comprising the culture medium component of 
Claim 113. 

115. The culture medium of Claim 114, wherein the culture medium 
comprises a concentration of an undefined protein component in a range from 
about 0.1% (w/v) to about 2.5%(w/v). 
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116. The culture medium of Claim 115, wherein the undefined protein 
component is selected from the group consisting of hydrolysates, digests, 
extracts and infusions. 

117. The culture medium of Claim 114, wherein the culture medium 
comprises a concentration of serum in a range from about 0.05% (v/v) to about 
30%(v/v). 

118. The culture medium of Claim 1 14, wherein the culture medium 
comprises insulin. 

119. An apparatus for identifying a culture medium component, 
comprising: 

means for determining a relationship between measured first indicia of a 
5 property of a plurality of first culture media which each contains a respective first 
test compound from within a first test library and at least one parameter of the 
first test compounds; and 

means for identifying a second test library containing a plurality of second 
test compounds as components of a plurality of second culture media which 
10 based on the relationship are expected to provide second indicia of the property 
which meets a test requirement relating to the measured first indicia. 

120. The apparatus of Claim 119, wherein said determining means 
comprises means for determining y , = f(Xij), where Xy denotes a parameter, i 
ranges from 1 to n where n represents the number of first culture media in the 
plurality thereof, j ranges from 1 to d where d represents the number of 

5 parameters, and y ^ represents an estimate of the measured first indicia of the 
property of the plurality of first culture media. 

121 . The apparatus of Claim 120, wherein said identifying means 
comprises means for determining from y , = f(Xij) estimated indicia of the property 
of a plurality of second culture media which each contains a respective test 
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compound, wherein at least one of the second culture media contains a test 
5 compound that is not within the first test library. 

122. The apparatus of Claim 119, wherein said determining means 
comprises: 

means for determining a distance function d(xi, X2) between a first value of 
5 a parameter, Xi, of a first test compound and a second value of a parameter, X2, 
of a second test compound not within the plurality of first test compounds; and 

means for estimating indicia of the property of a culture medium containing 
the second test compound as the indicia of the property of the culture medium 
containing the first test compound if d(xi, X2) < dcutoffi, where dcutoffi is a cutoff 
1 0 distance for the first test compound. 

123. A computer program product readable by a machine and tangibly 
embodying a program of instructions executable by the machine to perform the 
method steps of: 

determining a relationship between measured first indicia of a property of a 
5 plurality of first culture media which each contains a respective first test 

compound from within a first test library and at least one parameter of the first 
test compounds ; and 

identifying a second test library containing a plurality of second test 
compounds as components of a plurality of second culture media which based on 
1 0 the relationship are expected to provide second indicia of the property which 
meets a test requirement relating to the measured first indicia. 

124. The computer program product of Claim 123, wherein said 
determining step further comprises determining y i = f(Xij), where Xjj denotes a 
parameter, i ranges from 1 to n where n represents the number of first culture 
media from within a plurality thereof, j ranges from 1 to d where d represents the 

5 number of parameters, and y , represents an estimate of the measured first 
indicia of the property of the first test compounds. 
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125. The computer program product of Claim 124, wherein said 
identifying step further comprises determining from 9; = f(Xij) estimated indicia of 
the property of a plurality of second culture media which each contains a 
respective test compound, wherein at least one of the second culture media 

5 contains a test compound that is not within the first test library. 

126. The computer program product of Claim 125, wherein said 
determining step further comprises: 

determining a distance function d(xi, X2) between a first value of a 
parameter, Xi, of a first test compound and a second value of a parameter, X2, of 
5 a second test compound not within the plurality of first test compounds; and 

estimating indicia of the property of a culture medium containing the 
second test compound as the indicia of the property of the culture medium 
containing the first test compound if d(xi, X2) < dcutoffi, where dcutoffi is a cutoff 
distance for the first test compound. 

127. The method of Claim 97, wherein a vitamin is omitted from the 
culture medium formulation. 
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METHODS, APPARATUS AND COMPUTER PROGRAM 
PRODUCTS FOR FORMULATING CULTURE MEDIA 



Abstract of the Disclosure 

5 Methods, apparatus and computer program products are provided for 

identifying a component of culture medium based on the parameters (e.g., 
physical, chemical, biological and/or topological characteristics) of compounds 
from within a compound library. In preferred embodiments, the compound library 
is a peptide library. Also provided are methods of selecting a compound library 

10 from a larger compound space based on whole molecule (/.e., global) parameters 
of the compounds. Preferably, this method is practiced in conjunction with a 
method of identifying a component of a culture medium. Further provided are 
methods of predicting a biological activity of a peptide based on at least one 
whole molecule parameter of the peptide. This method finds use in methods of 

15 drug discovery, identifying components of culture medium, and identifying and/or 
designing peptides with particular pharmacological or therapeutic activities. 



72 



SEQUENCE LISTING 



<110> Haaland/ Perry D. 

Sherman, Douglas B. 
Stewart II, Walter W. 
Lloyd, Sheila A. 
Campbell, Robert L. 
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<140> 
<141> 

<160> 47 

<170> PatentIn Ver . 2.0 

<210> 1 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 1 

Gly Ala Leu Gly 
1 



<210> 2 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 2 

Gin Gly Val Glu 
1 



<210> 3 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 3 

Ser Ala Pro Val 
1 
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<210> 4 
<211> 4 
<212> PRT 

<213> Artificial Sequence 

<220> 

<22 3> Description of Artificial Sequence: hypothetical 
peptide 

<400> 4 

Ser Pro Ala Gin 
1 



<210> 5 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 5 

Glu Glu Val Phe 

1 



<210> 6 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 6 

Val Leu Ser Lys 
1 



<210> 7 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 7 

Val Ser Glu Leu 
1 



<210> 8 
<211> 4 
<212> PRT 
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<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 8 

Pro Phe Glu Pro 
1 



<210> 9 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 9 

Glu Leu Gin Glu 
1 



<210> 10 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 10 
Lys Val Gin Phe 
1 



<210> 11 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 11 
Gly Lys Ala Pro 
1 



<210> 12 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 
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<400> 12 
Ala Gin Lys Ser 
1 



<210> 13 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 13 
Ala Gin Gly Glu 
1 



<210> 14 
<211> 4 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 14 
Lys Glu Phe Gly 
1 



<210> 15 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 15 
Pro Ser Phe Lys 
1 



<210> 16 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 16 
Phe Ser Leu Ala 
1 
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<210> 17 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 17 
Leu Phe Gly Ala 
1 



<210> 18 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 18 
Glu Val Lys Ser 
1 



<210> 19 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 19 
Val Gly Glu Ala 
1 



<210> 20 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 20 
Gin Glu Ser Gin 
1 



<210> 21 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<22 3> Description of Artificial Sequence: hypothetical 
peptide 

<400> 21 
Gly Ala Pro Val 
1 



<210> 22 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 22 
Ser Ala Leu Gly 
1 



<210> 23 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 23 
Asp Lys Ala His 
1 



<210> 24 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 24 
Asp Trp Pro Ala 
1 



<210> 25 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 
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<400> 25 
Glu Ser Met His 
1 



<210> 26 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 26 
Gly Val Asn Glu 
1 



<210> 27 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 27 
His Glu Asp Val 
1 



<210> 28 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 28 
Glu Thr Gly Ser 
1 



<210> 29 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 29 
His Tyr Gly Val 
1 
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<210> 30 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 30 
Asp Phe Gly Val 
1 



<210> 31 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 31 
His Tyr Pro Val 
1 



<210> 32 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 32 
Ala Ala Ala Ala 
1 



<210> 33 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 33 
Ala Ala Ala Cys 
1 



<210> 34 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 34 
Ala Ala Cys Ala 
1 



<210> 35 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 35 
Ala Cys Ala Ala 
1 



<210> 36 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 36 
Cys Ala Ala Ala 
1 



<210> 37 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 37 
Ala Ala Cys Cys 
1 



<210> 38 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 38 
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Ala Cys Ala Cys 
1 



<210> 39 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 39 
Cys Ala Ala Cys 
1 



<210> 40 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 40 
Ala Cys Cys Ala 
1 



<210> 41 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 41 
Cys Ala Cys Ala 
1 



<210> 42 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 42 
Cys Cys Ala Ala 
1 



<210> 43 



82 



<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 43 
Ala Cys Cys Cys 
1 



<210> 44 

<211> 4 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 44 
Cys Ala Cys Cys 
1 



<210> 45 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 45 
Cys Cys Ala Cys 
1 



<210> 46 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 46 
Cys Cys Cys Ala 
1 



<210> 47 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: hypothetical 
peptide 

<400> 47 
Cys Cys Cys Cys 
1 
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Identification of a culture medium component based ont 
parameters of medium components 



MEASURE FIRST INDICIA OF A PROPERTY OF A FIRST PLURALITY 

OF CULTURE MEDIA WHICH EACH CONTAINS A FIRST TEST 
COMPOUND CHOSEN FROM A FIRST TEST LIBRARY BASED ON A 
SPACE-FILLING TECHNIQUE 



DETERMINE A RELATIONSHIP BETWEEN AT LEAST ONE 
PARAMETER OF THE FIRST TEST COMPOUNDS AND THE 
MEASURED INDICIA OF THE PROPERTY, WHERE THE 
RELATIONSHIP IS REPRESENTED BY y.= f(Xij), WHERE Xi, DENOTES 
A PARAMETER, i RANGES FROM 1 TO n WHERE n REPRESENTS 

THE NUMBER OF FIRST CULTURE MEDIA IN THE PLURALITY 
THEREOF, j RANGES FROM 1 TO d WHERE d REPRESENTS THE 
NUMBER OF PARAMETERS, AND y, REPRESENTS AN ESTIMATE OF 
THE MEASURED FIRST INDICIA OF THE PROPERTY OF THE 
PLURALITY OF FIRST CULTURE MEDIA 



■0I 



DETERMINE A TEST REQUIREMENT RELATING TO 
THE MEASURED FIRST INDICIA 



0 



BASED ON THE RELATIONSHIP BETWEEN THE MEASURED FIRST 

INDICIA AND AT LEAST ONE PARAMETER OF THE FIRST TEST 
COMPOUNDS, IDENTIFY A SECOND TEST LIBRARY CONTAINING A 
PLURALITY OF SECOND TEST COMPOUNDS AS COMPONENTS OF 
A PLURALITY OF SECOND CULTURE MEDIA WHICH ARE EACH 
PREDICTED TO PROVIDE INDICIA OF THE PROPERTY THAT 
SATISFIES THE TEST REQUIREMENT 
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IDENTIFICATION OF A CULTURE MEDIUM COMPONENT BASED ON^ 
PARAMETERS OF MEDIUM COMPONENTS 



MEASURE FIRST INDICIA OF A PROPERTY OF A FIRST PLURALITY 

OF CULTURE MEDIA WHICH EACH CONTAINS A FIRST TEST 
COMPOUND CHOSEN FROM A FIRST TEST LIBRARY BASED ON A 
SPACE-FILLING TECHNIQUE 



2,0 



DETERMINE A TEST REQUIREMENT RELATING TO 
THE MEASURED FIRST INDICIA 



joy 



DETERMINE A RELATIONSHIP BETWEEN AT LEAST ONE 
PARAMETER OF THE FIRST TEST COMPOUNDS AND THE 
MEASURED INDICIA OF THE PROPERTY, WHERE THE 
RELATIONSHIP IS REPRESENTED BY A DISTANCE FUNCTION 
d(xi,X2) BETWEEN A FIRST VALUE OF A PARAMETER, Xi, OF A FIRST 
TEST COMPOUND AND A SECOND VALUE OF THE PARAMETER, X2, 
OF A SECOND TEST COMPOUND NOT WITHIN THE PLURALITY OF 

FIRST TEST COMPOUNDS 



ESTIMATING INDICIA OF THE PROPERTY OF A CULTURE MEDIUM 
CONTAINING THE ADDITIONAL TEST COMPOUND AS THE INDICIA 
OF THE PROPERTY OF THE CULTURE MEDIUM CONTAINING THE 
FIRST TEST COMPOUND IF d(xi, X2)^ d<;ut«ff, WHERE dcutoff IS A CUTOFF 

DISTANCE 



BASED ON THE RELATIONSHIP BETWEEN THE MEASURED FIRST 

INDICIA AND AT LEAST ONE PARAMETER OF THE FIRST TEST 
COMPOUNDS, IDENTIFY A SECOND TEST LIBRARY CONTAINING A 
PLURALITY OF SECOND TEST COMPOUNDS AS COMPONENTS OF 
A PLURALITY OF SECOND CULTURE MEDIA WHICH ARE EACH 
PREDICTED TO PROVIDE INDICIA OF THE PROPERTY THAT 
SATISFIES THE TEST REQUIREMENT 



FIG. 3 
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C FORMATION OF A REPRESENTATIVE COMPOUND LIBRARY BASED ON 
WHOLE MOLECULE PARAMETERS 
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REPRESENT EACH OF A PLURALITY OF GROUPS OF COMPOUND ISOMERS 
FROM WITHIN A COMPOUND SPACE AS A RESPECTIVE CANDIDATE 
COMPOUND BASED ON AT LEAST ONE SHARED GLOBAL PARAMETER 










EXPAND LESS THAN ALL OF THE CANDIDATE COMPOUNDS INTO THEIR 
CONSTITUENT COMPOUND ISOMERS USING A SPACE-FILLING DESIGN 








DEFINE A TEST LIBRARY CONTAINING LESS THAN ALL OF THE EXPANDED 

COMPOUND ISOMERS 





FIG. 6 



IDENTIFICATION OF A CULTURE MEDIUM 
COMPONENT BASED ON 
PARAMETERS OF THE MEDIUM COMPONENTS 




OF THE CANDIDATE COMPOUNDS 




EXPAND THE SELECTED CANDIDATE COMPOUND 
BY EXPANDING INTO INDIVIDUAL COMPOUND ISOMERS 




FIG. 7 




PREDICTION OF THE BIOLOGICAL ACTIVITY OF A PEPTIDE BASED ON 
WHOLE MOLECULE PARAMETERS OF THE PEPTIDE 




MEASURING INDICIA OF A BIOLOGICAL ACTIVITY OF A PLURALITY OF TEST 
PEPTIDES CHOSEN FROM A FIRST TEST PEPTIDE LIBRARY BASED ON A 

SPACE-FILLING DESIGN 



DETERMINING A RELATIONSHIP BETWEEN THE MEASURED INDICIA OF THE 
BIOLOGICAL ACTIVITY AND AT LEAST ONE WHOLE MOLECULE PARAMETER OF 

THE PLURALITY OF TEST PEPTIDES 



PREDICTING THE INDICIA OF THE BIOLOGICAL ACTIVITY OF A PEPTIDE NOT 
WITHIN THE PLURALITY OF TEST PEPTIDES BASED ON THE RELATIONSHIP 
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DECLARATION AND POWER OF ATTORNEY FOR PATENT APPLICATION 

Attorney Docket No. P3250 

As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name. 

I believe I am the original, first and sole inventor (if only one name is hsted below) or an 
original, first and joint inventor (if plural names are listed below) of the subject matter which 
is claimed and for which a patent is sought on the invention entitled METHODS^ 
APPARATUS AND COMPUTER PROGRAM PRODUCTS FOR FORMULATING 
CULTURE MEDIA, 

the specification of which 
IXI is attached hereto 
OR 

□ was filed on as United States Application No. or PCX Intemational Application 

Number and was amended on (if applicable). 

I hereby state that I have reviewed and understand the contents of the above-identified 
specification, including the claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose information which is material to patentability as defined in 
Title 37 Code of Federal Regulations, §1.56. 

I hereby claim foreign priority benefits under Title 35, United States Code, § 119(a)-(d) or 
§ 365(b) of any foreign apphcation(s) for patent or inventor's certificate, or § 365(a) of any 
PCT Intemational application which designated at least one country other than the United 
States of America, listed below and have also identified below any foreign application for 
patent or inventor's certificate, or of any PCT Intemational application having a filing date 
before that of the application on which priority is claimed. 









□ Yes □ No 


Number 


Country 


MM/DD/YYYY Filed 


Priority Claimed 














Number 


Country 


MM/DD/YYYY Filed 


□ Yes □ T^Tu 

Priority Claimed 












Number 


Coimtry 


MM/DD/YYYY Filed 


□ Yi.s □ Nu 

Priority Claimed 
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I hereby claim the benefit under Title 35, United States Code, § 11 9(e) of any United States 
provisional application(s) listed below. 







Application Number(s) 


Filing Date (MM/DDATYYY) 








Application Number(s) 


Filing Date (MM/DD/YYYY) 



I hereby claim the benefit under Title 35, United States Code, § 120 of any United States 
application(s) or § 365(c) of any PCT international apphcation designating the United States 
of America, hsted below and, insofar as the subject matter of each of the claims of this 
application is not disclosed in the prior United States or PCT International apphcation(s) in 
the manner provided by the first paragraph of Title 35, United States Code, § 1 12, 1 
acknowledge the duty to disclose information which is material to patentability as defined in 
Title 37, Code of Federal Regulations, § 1.56 which became available between the filing date 
of the prior application and the national or PCT international filing date of this apphcation 
(37C.RR. § L63(d)), 









Appln. Serial No. 


Filing Date 


Status 

Patented/Pending/Abandoned 












Appln. Serial No. 


Filing Date 


Status 

Patented/Pending/Abandoned 












Appln. Serial No. 


Filing Date 


Status 

Patented/Pending/Abandoned 



I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these 
statements were made with the knowledge that willful false statements and the hke so made 
are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United 
States Code and that such willful false statements may jeopardize the vahdity of the 
application or any patent issued thereon. 



POWER OF ATTORNEY: As a named inventor, I hereby appoint the following registered 
attomey(s) to prosecute this application and transact all business in the Patent and Trademark 
Office connected therewith. 



Richard J. Rodrick 


26,985 


Richard E. Brown 


30,934 


Alan W. Fiedler 


33,690 


David W. Highet 


30,265 


Donna R. Fugit 


32,135 


Susan A. Capello 


34,560 


Eric M. Lee 


30,471 


Allen W. Wark 


30,503 


Nanette S. Thomas 


33,310 


Arthur D. Dawson 


35,113 


Bruce S. Weintraub 


34,277 


John L. Voellmicke 


30,348 


Keith J. McWha 


44,235 


Scott S. Servilla 


40,806 


Robert M. Hallenbeck 


31,364 







Send correspondence to: Richard J. Rodrick 

Becton, Dickinson and Company 
1 Becton Drive 

Franklin Lakes, New Jersey 07417 

Direct telephone calls to: Donna R. Fugit, Ph.D. 

(201) 847-7166 
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Full name of inventor: Robert L. Campbell 



Inventor's 
Signature: 




Residence: Bahama, North Carolina 

Citizenship: United States of America 

Post Office Address: 607 Phar Lap Lane 

Bahama, North Carolina 27503 
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Full name of inventor: 



Perry D. Haaland 



Inventor's 
Signature 

Residence: 



: <5. 



Citizenship: 

Post Office Address: 



^/s.^=&v^Z_Date: J t . Hl^ 

Chapel Hill, North CaroHna 

United States of America 

8914 Laurel Springs Drive 
Chapel Hill, North Carolina 27516 
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Full name of inventor: 



Douglas B. Sherman 



Inventor's C) J 9/ „ . 

Signature: A Jk&4^^^^am Date: ^^^\y ij } 

Residence: Durham, North Carolina 

Citizenship: United States of America 

Post Office Address: 7 Lansgate Court 

Durham, North Carolina 27713 



Full name of inventor: Walter William Stewart II 



Inventor's 
Signature: 




Residence: Gary, North Carolina 

Citizenship : United States of America 

Post Office Address: 207 Electra Drive 

Cary, North Carolina 27513 
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Full name of inventor: Sheila A. Lloyd 

Inventor's a /) kJ^ f i I 

Signature: ydjjj/ltj 0. ^^AdU^ Date: 7/7 /^j 

Residence: Gary, North Carolina 

Citizenship : United States of America 

Post Office Address: 119 Flora MacDonald Lane 

Gary, North Carolina 275 1 1 
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